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(57) Abstract: Nature evolves biological 
molecules such as proteins through iterated 
rounds of diversification, selection, and 
amplification. The present invention pro- 
vides methods, compositions, and systems 
for syntheiszing, selecting, amplifying, and 
evolving non-natural molecules based on 
nucleic acid templates. The sequence of a 
nucleic acid template is used to direct the 
synthesis of non-naturaJ molecules such 
as unnatural polymers and small molecules. 
Using this method combinatorial libraries 
of these molecules can be prepared and 
screened. Upon selection of a molecule, 
its encoding nucleic acid template may 
be amplified and/or evolved to yield the 
same molecule of the present invention 
allow for the amplification and evolution of 



non-natural molecules in a manner analogous to the amplification of natural biopolymer such as polynucleotides and protein. 
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Evolving New Molecular Function 



PRioiaTy Information 
[0001] This application claims priority under 35 U.S.C- § 119(e) to U-S. Provisional patent 
applications 60/277,081, filed March 19, 2001, entitled ''Nucleic Acid Directed Synthesis of 
Chemical Compounds'':. 60/277,094, filed March 19, 2001, entitled "Approaches to Generating 
New Molecular Function"; and 60/306,691, filed July 20, 2001, entitled "Approaches to 
Generating New Molecular Function", and the entire contents of each of these applications are 
hereby incorporated by reference. 

Background of the Invention 
[0002]^ The classic "chemical approach" to generating molecules with new functions has been 
used extensively over the last century in applications ranging firom drag discovery to synthetic 
methodology to materials science. In this approach (Fig. 1, black), researchers synthesize or 
isolate candidate molecules, assay these candidates for desired properties, determine the 
structures of active compounds if unknown, formulate structure-activity relationships based on 
the assay and structural data, and then synthesize a new generation of molecules designed to 
possess improved properties. While combinatorial chemistry methods {see, for example, A. V. 
Eliseev and J. M. Lehn. Combinatorial Chemistry In Biology 1999, 243, 159-172; K. W. Kuntz, 
M. L. Snapper and A. H. Hoveyda. Current Opinion in Chemical Biology 1999, J, 313-319; D. 
R. Liu and P. G. Schultz. Angew. Chem. ML Ed Eng. 1999, 38, 36) have increased the 
throughput of this approach, its fundamental limitations remain unchanged. Several factors limit 
the effectiveness of the chemical approach to generating molecular function. First, our ability to 
accurately predict the structural changes that will lead to new function is often inadequate due to 
subtle conformational rearrangements of molecules, unforeseen solvent interactions, or unknown 
stereochemical requirements of binding or reaction events. The resulting complexity of 
stracture-activity relationships firequently limits the success of rational ligand or catalyst design, 
including those efforts conducted in a high-throu^put manner. Second, the need to assay or 
screen, rather than select, each member of a collection of candidates limits the number of 
molecules that can be searched in each experiment. FiQally, the lack of a way to amplify 
synthetic molecules places requirements on the minimum amount of material that must be 



1 of 88 



wo 02/074929 




PCT/US02/08546 



produced for characterization, screening, and structure elucidation. As a result, it can be diJBBcult 
to generate libraries of more than roughly 10^ different synthetic compoimds. 
[0003] In contrast. Nature generates proteins with new functions using a fundamentally 
different method that overcomes many of these limitations. In this approach (Fig. 1, gray), a 
protein with desired properties induces the survival and amplification of tiie information 
encoding that protein. This information is diversified through spontaneous mutation and DNA 
recombination, and then translated into a new generation of candidate proteins using the 
ribosome. The power of this process is well appreciated (see, F. Arnold Acc. Chem. Res. 1998, 
57, 125; F.H. Arnold etaL Cum Opin. Chem, Biol 1999, 5, 54-59; J. MinshuU et al Curr, Opin, 
Chem. Biol 1999, J, 284-90) and is evidenced by the fact that proteins and nucleic acids 
dominate the solutions to many complex chemical problems despite their limited chemical 
fimctionality. Clearly, unlike the linear chemical approach described above, the steps used by 
Nature form a cycle of molecular evolution. Proteins emerging from this process have been 
directly selected, rather than simply screened, for desired activities. Because the information 
encoding evolving proteins CDNA) can be amplified, a single protein molecule with desired 
activity can in theory lead to the survival and propagation of the DNA encoding its structure. 
The vanishiiigly small amounts of material needed to participate in a cycle of molecular 
evolution allow Ubraries much larger in diversity than those synthesized by chemical approaches 
to be generated, and selected for desired function in small volumes. 

[0004] Acknowledging the power and eflSciency of Nature's approach, researchers have used 
molecxilar evolution to generate many proteins and nucleic acids with novel binding or catalytic 
properties (see, for example, J. MinshuU et al Curr. Opin, Chem, Biol 1999, 5, 284-90; C. 
Schmidt-Dannert et al Trends Biotechnol 1999, 77, 135-6; D. S. Wilson et al Annu. Rev. 
Biochem. 1999, 68, 611-47). Proteins and nucleic acids evolved by researchers have 
demonstrated value as research tools, diagnostics, industrial reagents, and therapeutics and have 
greatiy expanded our understanding of the molecular interactions that endow proteins and 
nucleic acids with binding or catalytic properties {see, M. Famulok et al Curr. Opin. Chem. Biol 
1998, 2, 320-7). 

[0005] Despite nature's efficient approach, to generatirig fimction, nature's molecular 
evolution is limited to two types of "natural" molecules — proteins and nucleic acids— because 
thus far the information in DNA can only be translated into proteins or into other nucleic acids. 
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However, many synthetic molecules of interest do not in general represent nucleic acid 
backbones, and the use of DNA-templated synthesis to translate DNA sequences into synthetic 
small molecules would be broadly useful only if synthetic molecules other than nucleic acids and 
nucleic acid analogs could be synthesized in a DNA-templated fashion. An ideal approach to 
generating functional molecules would merge the most powerful aspects of molecular evolution 
with the flexibility of synthetic chemistry. Clearly, enabling the evolution of non-natural 
synthetic small molecules and polymers, similarly to the way nature evolves biomolecules, 
would lead to much more effective methods of discovering new synthetic ligands, receptors, and 
catalysts difficult or impossible to generate using rational design. 

Summary of the Invention 
[0006] The recognition of the need to be able to amplify and evolve classes of molecules 
besides nucleic acids and proteins led to the present invention providing methods and 
compositions for the template-directed synthesis, amplification, and evolution of molecules. In 
general, these methods use an evolvable template to direct the synthesis of a chemical compound 
or library of chemical compounds (ie., the template actually encodes the synthesis of a chemical 
compound). Based on a library encoded and synthesized using a template such as a nucleic acid, 
methods are provided for amplifying, evolving, and screening the library. In certain 
embodiments of special interest, the cheniiical compounds are compoimds that are not, or do not 
resemble, nucleic acids or analogs thereof. In certain embodiments, the chemical compounds of 
these template-encoded combinatorial libraries are polymers and more preferably are unnatural 
polymers excluding natural peptides, proteins, and polynucleotides). In other embodiments, 
the chemical compounds are small molecides. 

[0007] In certain embodiments, the method of synthesizing a compound or library of 
compounds comprises first providing one or more nucleic acid templates, which one or more 
nucleic acid templates optionally have a reactive unit associated therewith. The nucleic acid 
template is then contacted with one or more transfer units designed to have a first moiety, an 
anti-codon, which hybridizes to a sequence of the nucleic acid, and is associated with a second 
moiety, a reactive unit, which includes a building block of the compound to be synthesized. 
Once these transfer xmits have hybridized to the nucleic acid template in a sequence-specific 
manner, the synthesis of the chemical compoimd can take place due to the interaction of reactive 
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moieties present on the transfer units and/or the nucleic acid ten:iplate. Signficantly, the 
"se<JSeHce"of"the"^fi^ 

attached compound and thereby its structure. It will be appreciated that the method described 
herem may be used to synthesize one molecule at a tune or may be used to synthesize thousands 
to millions of compounds using combinatorial methods. 

[0008] It will be appreciated that libraries synthesized in this manner (i.e., having been 
encoded by a nucleic acid) have the advantage of being amplifiable and evolvable. Once a 
molecule is identified, its nucleic acid template besides acting as a tag used to identify the 
attached compoxmd can also be amplified using standard DNA techniques such as the 
polymerase chain reaction (PGR). The amplified nucleic acid can then be used to synthesize 
more of the desired compound. In certain embodiments, during the amplification step mutations 
are introduced into the nucleic acid in order to generate a population of chemical compounds that 
are related to the parent compoxmd but are modified at one or more sites. The mutated nucleic 
acids can then be used to synthesize a new library of related compoimds. In this way, the library 
being screened can be evolved to contain more compounds with the desired activity or to contain 
compounds with a higher degree of activity. 

[0009] The methods of the present invention may be used to synthesize a wide variety of 
chemical compounds. In certain embodiments, the methods are used to synthesize and evolve 
unnatural polymers (ie,, excluding polynucleotides and peptides), which cannot be amplified 
and evolved using standard techniques currently available. In certain other embodiments, the 
inventive methods and compositions are utilized for the synthesis of small molecules that are not 
typically polymeric. In still other embodiments, the method is utilized for the g^nerateion of 
non-natural nucleic acid polymers. 

[0010] The present invention also provides the transfer molecules (e.g., nucleic acid 
templates and/or transfer units) usefiil in the practice of the inventive methods. These transfer 
molecules typically include a portion capable of hybridizing to a sequence of nucleic acid and a 
second portion with monomers, other building blocks, or reactants to be incorporated into the 
final compound being synthesized. It will be appreciated that the two portions of the transfer 
molecule are preferably associated with each other either directly or through a Linker moiety. It 
will also be appreciated that the reactive unit and the anti-codon may be present in the same 
molecule (e.g., a non-natural nucleotide having functionality incorporated therein). 
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[0011] The present invention also provides kits and compositions useful in the practice of the 
lnwMve"^eQi 'rffese^ldts^may~incluae~m^^ 

monomers, solvents, buffers, enzymes, reagents for PGR, nucleotides, small molecule scaffolds, 
etc. The kit may be used in the synthesis of a particular ^e of unnatural polymer or small 
molecule. 

Definitions 

[0012] The term antibody refers to an immunoglobulin, whether natural or wholly or 
partially synthetically produced. All derivatives thereof which maintain specific binding ability 
are also included in the term. The term also covers any protein having a binding domain which 
is homologous or largely homologous to an immunoglobtdin binding domain. These proteins 
may be derived from natural sources, or partly or wholly synthetically produced. An antibody 
may be monoclonal or polyclonal. The antibody may be a member of any immimoglobulin 
class, including any of the human classes: IgG, IgM, IgA, IgD, and IgE. Derivatives of the IgG 
class, however, are preferred in the present invention. 

[0013] The term, associated withy is used to describe the interaction between of among two 
or more groups, moieties, compoimds, monomers, etc. When two or more entities are 
'^associated with" one another as described herein, they are linked by a direct or indirect covalent 
or non-covalent interaction. Preferably, the association is covalent. The covalent association 
may be through an amide, ester, carbon-carbon, disulfide, carbamate, ether, or carbonate linkage. 
The covalent association may also include a linker moiety such as a photocleavable linker. 
Desirable non-covalent interactions include hydrogen bonding, van der Waals interactions, 
hydrophobic interactions, magnetic interactions, electrostatic interactions, etc. Also, two or more 
entities or agents may be "associated" with one another by being present together in the same 
composition. 

[0014] A biological macromolecule is a polynucleotide (e.g., RNA, DNA, RNA/DNA 
hybrid), protein, peptide, lipid, natural product, or polysaccharide. The biological 
77iacromolecule may be naturally occurring or non-naturally occurring. In a preferred 
embodiment, a biological macromolecule has a molecular weight greater than 500 g/mol. 
[0015] Polynucleotide, nucleic acid, or oligonucleotide refers to a polymer of nucleotides. 
The polymer may include natural nucleosides (i.e., adenosine, thymidine, guanosine, cytidine. 
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uridine, deoxyadenosine, deoxythymidine, deoxyguanosine, and deoxycytidine), nucleoside 
"aiMogs — ^('e:g:/ ~2-aniinoadtmosiBer~"2-^^ 

adenosine, 5-methylcytidine, C5-bromouridine, C5-fluorouridine, C5-iodonridine, 
C5-propynyl-uridine, C5-propynyl-cytidine, C5-methylcytidine, 7-deazaadenosine, 
7-deazaguanosine, 8-oxoadenosine, 8-oxoguanosine, 0(6)-methylguanine, and 2-thioc3didine), 
chemically modified bases, biologically modified bases (e.g., methylated bases), intercalated 
bases, modified sugars (e.g., 2 -fluororibose, ribose, 2'"deoxyribose, arabinose, and hexose), or 
modified phosphate groups (e.g., phosphorothioates and 5* -N-phosphoramidite linkages). 
[0016] A protein comprises a polymer of amino acid residues linked together by peptide 
bonds. The term, as used herein, refers to proteins, polypeptides, and peptide of any size, 
structure, or function. Typically, a protein will be at least three amino acids long. A protein may 
refer to an individual protein or a collection of proteins. A protein may refer to a full-length 
protein or a fragment of a protein. Inventive proteins preferably contain only natural amino 
acids, although non-natural amino acids (he., compounds that do not occur in nature but that can 
be incorporated into a polypeptide chain; see, for example, 
http://wwwxcoxaltech.edu/-dadgip/Unimtstruct.gif, vMch displays stmctures of non-natural 
amino acids that have been successfully incorporated into functional ion channels) and/or amino 
acid analogs as are known in the art may alternatively be employed. Also, one or more of the 
amino acids in an inventive protein may be modified, for example, by the addition of a chemical 
entity such as a carbohydrate group, a hydroxyl group, a phosphate group, a famesyl group, an 
isofamesyl group, a fatty acid group, a linker for conjugation, fimctionalization, or other 
modification, etc. A protein may also be a single molecule or may be a multi-molecular 
complex. Aproteinmay be just a fiagment of a naturally occurring protein or pq)tide. A protein 
may be naturally occurring, recombinant, or synthetic, or any combination of these. 
[0017] The term small molecule, as used herein, refers to a non-peptidic, non-oligomeric 
organic compound either synthesized in the laboratory or found in nature. Small molecules, as 
used herein, can refer to compounds that are "natural product-like", however, the term "small 
molecule" is not limited to "natural product-like" compounds. Rather, a small molecule is 
typically characterized in that it possesses one or more of the following characteristics including 
having several carbon-carbon bonds, having multiple stereocenters, having multiple functional 
groups, having at least two different types of functional groups, and having a molecular weight 
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of less than 1500, although this characterizatioii is not intended to be limiting for the purposes of 
IKe preseiit"invmfion: ' 

[0018] The term small molecule scaffold, as used herein, refers to a chemical compound 
having at least one site for fimctionalization. In a preferred embodiment, the small molecule 
scaffold may have a multitude of sites for fimctionalization- These fimctionalization sites may 
be protected or masked as would be appreciated by one of skill in this art. The sites may also be 
found on an underlying ring structure or backbone. 

[0019] The term transfer unit, as used herem, refers to a molecule comprising an anti-codon 
moiety associated with a reactive xinit, including, but not limited to a building block, monomer, 
monomer unit, or reactant used ia synthesizing the nucleic acid-encoded molecules. 

Description of the Figures 
[0020] Figure 1 depicts natme's approach (gray) and the classical chemical approach (black) 
to generating molecular function. 

[0021] Figure 2 depicts certain DNA-templated reactions for nucleic acids and analogs 
thereof. 

[0022] Figure 3 depicts the general method for synthesizing a polymer using nucleic acid- 
templated synthesis. 

[0023] Figure 4 shows a quadruplet and triplet non-frameshifling codon set. Each set 
provides nine possible codons. 

[0024] Figure 5 shows methods of screening a library for bond-cleavage and bond-formation 
catalysts. These methods take advantage of streptavidin's natural affinity for biotin. 
[0025] Figure 6A depicts the synthesis directed by hairpin (H) and end-of-helix (E) DNA 
templates. Reactions were analyzed by denaturing PAGE after the indicated reaction times. 
Lanes 3 and 4 contained templates quenched with excess P-mercaptoethanol prior to reaction. . 
[0026] Figure 6B depicts matched (M) or mismatched (X) reagents linked to thiols (S) or 
primary amines (N) were mixed with 1 equiv of template fimctionalized with the variety of 
electrophiles shown. Reactions with thiol reagents were conducted at pH 7.5 under the following 
conditions: SLAB and SBAP: 37°C, 16 h; SIA: 25°C, 16 h, SMCC, GMBS, BMPS, SVSB: 
25°C, 10 min. Reactions with amine reagents were conducted at 25°C, pH 8.5 for 75 minutes. 
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[0027] Figure 7 depicts (a) H templates linked to a-iodoacetamide group which were reacted 
"witirtKiolTeageStslcon^ 

at the indicated temperature for 16 h. Calculated reagent Tm: 38°C (matched), 28°C (single 
mismatch). 

[0028] Figure 8 depicts a reaction performed using a 41 -base E template and a 10-base 
reagent designed to aimeal 1-30 bases from the 5' end of the template. The kinetic profiles in the 
graph show the average of two trials (deviations < 10%). The "n = 1 mis" reagent contains three 
mismatches. 

[0029] Figure 9 depicts the repeated n = 10 reaction in Figure 8 in which the nine bases 
following the 5*-NH2-dT were replaced with the backbone analogues shown. Five equivalents 
of a DNA oligonucleotide complementary to the intervening bases were added to the "DNA + 
clamp'* reaction. Reagents were matched (0) or contained three mismatches (3). The gel shows 
reactions at 25°C after 25 min. 

[0030] Figure 10 depicts the n = 1, n = 10, and n = 1 mismatched (mis) reactions described 
in Figure 8 which were repeated with template and reagent concentrations of 12.5, 25, 62.5 or 
125 nM. 

[0031] Figure . 11 depicts a model translation, selection and amplification of synthetic 
molecules tiiat bind streptavidin fi-om a DNA-encoded library. 

[0032] Figure 12 depicts (a) Lanes 1 and 5: PCT: amplified library before streptavidin 
binding selection. Lanes 2 and 6: PGR amplified library after selection. Lanes 3 and 7: PGR 
amplified authentic biotin-encoding template. Lane 4: 20 bp ladder. Lanes 5-7 were digested 
with Tsp45I. DNA sequencing traces of the amplified templates before and after selection are 
also shown, together with the sequences of the non-biotin encoding and biotin-encoding 
templates, (b) General scheme for the creation and evolution of libraries of non-natural 
molecules using DNA-templated synthesis, where -Ri represents the library of product 
functionality transferred from reagent Ubrary 1 and -Rib represents a selected product. 
[0033] Figure 13 depicts exemplary DNA-templated reactions. For all reactions under the 
specified conditions, product yields of reactions with matched template and reagent sequences 
were greater than 20-fold higher than that of control reactions with scrambled reagent sequences. 
Reactions were conducted at 25®C with one equivalent each of template and reagent at 60 nM 
final concentration unless otherwise specified. Conditions: a) 3 mM NaBHsCN, 0.1 M MES 
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buffer pH 6.0, 0.5 M NaCl, 1 .5 h; b) 0.1 M TAPS buffer pH 8.5, 300 mM NaCl, 12 h; c) 0.1 M 

e) 12O11MI9, 1.4nMNa2PdCU,0.5MTSraOAcbufferpH5.0,18h;f) Prenux Na2PdCl4 with 
two equivalents of P(p-S03C6H4)3 in water 15 min., then add to reactants in 0.5 M NaOAc buffer 
pH 5.0, 75 mM NaCl, 2 h (final [Pd] = 0.3 mM, [19] = 120 nM). The olefin geometry of 
products firom 13 and the regiochemistries of cycloaddition products firom 14 and 16 are 
presumed but not verified. 

[0034] Figure 14 depicts analysis by denaturing polyacrylamide gel electrophoresis of 
representative DNA-templated reactions listed in Figures 13 and 15. The structures of reagents 
and templates correspond to the nimibering in Figures 13 and 15. Lanes 1, 3, 5, 7, 9, 11: 
reaction of matched (complementary) reagents and templates under conditions listed in Figures 
13 and 15 (Ibe reaction of 4 and 6 was mediated by DMT-MM). Lanes 2, 4, 6, 8, 10, 12: 
reaction of mismatched (non-complementary) reagents and templates under conditions identical 
to those in lanes 1, 3, 5^ 7, 9 and 11, respectively. 

10035] Figure 15 depicts DNA-templated amide bond formation mediated by EDC and 
sxilfo-NHS or by DMT-MM for a variety of substituted carboxylic acids and amines. In each 
row, yields of DMT-MM-mediated reactions between reagents and, templates complementary in 
sequence are followed by yields ofEDC and sulfo-NHS-mediated reactions. Conditions: 60 nM 
template, 120 nM reagent, 50 mM DMT-MM m 0.1 M MOPS buffer pH 7.0, 1 M NaCl, 16 h, 
25^C; or 60 nM template, 120 nM reagent, 20 mM EDC, 15 mM suIfo-NHS, 0.1 M MES buffer 
pH 6.0, 1 M NaCl, 16 h, 25*=*C. In all cases, control reactions with mismatched reagent 
sequences yielded little or no detectable product. 

[0036] Figure 16 depicts (a) Conceptual model for distance-independent DNA-templated 
synthesis. As the distance between the reactive groups of an annealed reagent and template (n) is 
increased, the rate of bond formation is presrmaed to decrease. For those values of n in which the 
rate of bond formation is significantly higher than the rate of template-reagent annealing, the rate 
of product formation remains constant. In this regime, the DNA-templated reaction shows 
distance independence, (b) Denaturing polyacrylamide gel electrophoresis of a DNA-templated 
Wittig olefination between complementary 11 and 13 with either zero bases (lanes 1-3) or ten 
bases (lanes 4-6) separating annealed reactants. Although the apparent second order rate 
constants for the n = 0 and n = 10 reactions differ by three-fold (kapp (n = 0) = 9.9 x IO3 M"^s"^ 
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wbile kapp (n = 10) = 3.5 x 10^ IVT^s'^), product yields after 13 h at both distances are nearly 
"quaatitofiverXaS^ 
(not shown). 

[0037] Figure 17 depicts certain exemplary DNA-templated complexity building reactions. 
[0038] Figure 18 depicts certain exemplary linkers for use in the method of the invention. 
[0039] Figure 19 depicts certain additional exemplary linkers for use in the method of the 
invention. 

[0040] Figure 20 depicts an -exemplary thioester linker for use in the 'method of the 
invention. 

[0041] Figure 21 depicts DNA-templated amide bond formation reactions in which reagents 
and templates are complexed with dimethyldidodecylammonivim cations. 

[0042] Figure 22 depicts the assembly of transfer units along the nucleic acid template and 
polymerization of the nucleotide anti-codon moieties. 

[0043] Figure 23 depicts the polymerization of the dicarbamate units along the nucleic acid 
template to form a polycarbamate. To initiate polymerization the "stailf ' monomer ending in a o- 
nitrobenzylcarbamate is photodeprotected to reveal the primary amine that initiates carbamate 
polymerization. Polymerization then proceeds in the 5' to 3' direction along the DNA backbone, 
with each nucleophilic attack resulting^in the subsequent unmasking of a new amine nucleophile. 
Attack of the "stop" monomer liberates an acetamide rather than an anome, thereby terminating 
polymerization. 

[0044] Figure 24 depicts cleavage of the polycarbamate from the nucleotide backbone. 
Desilylation of the enol ether linker attaching the anti-codon moiety to the monomer \mit and the 
elimination of phosphate driven by the resulting release of phenol provides the provides the 
polycarbamate covalently linked at its carboxy terminus to its encoding single-stranded DNA. 
[0045] Figure 25 depicts components of an amplifiable, evolvable functionalized peptide 
nucleic acid library. 

[0046] Figure 26 depicts test reagents used to optimize reagents and conditions for DNA- 
templated PNA coupling. 

[0047] Figure 27 depicts a simple set of PNA monomers derived from commercially 
available building blocks useftil for evolving a PNA-based fluorescent Ni^"*" sensor. 
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[0048] Figure 28 depicts two schemes for the selection of a biotin-tenninated functionalized 

— PNA-capable of cataly^g-an-aldol or-retrGaldolTeaetion^^ - - 

[0049] Figure 29 depicts DNA-template-dkected synthesis of a combinatorial small 
molecule library. 

[0050] Figure 30 shows schematically how DNA-linlced small molecule scaffolds can be 
functionalized sequence-specfiically by reaction with synthetic reagents linked to 
complementary nucleic acid oligonucleotides, this process can be repeated to complete the 
synthetic transfomiations leading to a fiilly functionalized molectile. 

[0051] Figure 31 shows the fimctionalization of a cephalosporin small molecule scaffold 
with various reactants. 

[0052] Figure 32 depicts a way of measuring the rate of reaction between a jBxed 
nucleophile and an electrophile hybridized at varying distances along a nucleic acid template to 
define an essential reaction window in which nucleic acid-templated synthesis of nonpolymeric 
structures can take place. 

[0053] Figure 33 depicts three linker strategies for DNA-templated synthesis. In the 
autocleaving linker strategy, the bond coimecting the product firom the reagent oligonucleotide is 
cleaved as a natural consequence of the reaction. In the scarless and usefiil scar linker strategies, 
this bond is cleaved following the DNA-templated reaction. The depicted reactions were 
analyzed by denaturing polyacrylamide gel electrophoresis (below). Lanes 1-3 were visualized 
using UV light without DNA staining; lanes 4-10 were visualized by staining with ethidiimi 
bromide following by UV transillumination. Conditions: 1 to 3: one equivalent each of reagent 
and template, 0.1 M TAPS buffer pH 8.5, 1 M NaCl, 25 ^'C, 1.5 h; 4 to 6: tiiree equivalents of 4, 
0.1 M MES buflEer pH 7.0, 1 M NaNOa, 10 mM AgNOa, 37 °C, 8 h; 8 to 9: 0.1 M CAPS bufifer 
pH 11.8, 60 mM BME, 37 °C, 2 h; 11 to 12: 50 mM aqueous NaI04, 25 °C, 2 h. Ri = 
NH(CH2)2NH-dansyl; Ri = biotin. 

[0054] Figure 34 depicts strategies for purifying products of DNA-templated synthesis. 
Using biotinylated reagent oligonucleotides, products arising jBrom an autocleaving linker are 
partially purified by washing the crude reaction witli avidm-linked beads (top). Products 
generated from DNA-templated reactions using the scarless or usefial scar linkers can be purified 
by using biotinylated reagent oligonucleotides, capturing crude reaction products with avidin- 
linked beads, and eluting desired products by inducing linker cleavage (bottom). 
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[0055] Figure 35 depicts the generation of an initial template pool for an exemplary library 
synthesis. 

[0056] Figure 36 depicts the DNA-templated synthesis of a non-natural peptide library. 
[0057] Figure 37 depicts a 5 '-reagent DNA-linker-amino acid. 

[0058] Figure 38 depicts the DNA-templated synthesis of an evolvable diversity oriented 
bicyclic library. 

[0059] Figure 39 depicts DNA-templated multi-step tripeptide synthesis. Each DNA- 
templated amide formation used reagents containing the sulfone linker described in the text. 
Conditions: step 1: activate two equivalents 13 in 20 mM EDC, 15 mM sxilfo-NHS, 0.1 M MES 
buffer pH 5.5, 1 M NaCl, 10 min, 25 °C, then add to template in 0.1 M MOPS pH 7.5, IM NaCl, 
25°C, 1 h; steps 2 and 3: two equivalents of reagent, 50 mM DMT-MM, 0.1 M MOPS buffer pH 
7.0, 1 M NaCl, 6 h, 25 °C. Desired product after each step was purified by capturing on avidin- 
linked beads and eluting with 0.1 M CAPS buffer pH 11.8, 60 mM BME, 37 °C, 2 h. The 
progress of each reaction and purijBcation was followed by denaturing polyacrylamide gel 
electrophoresis (bottom). Lanes 3, 6, and 9: control reactions using reagents containing 
scrambled oligonucleotide sequences. 

[0060] Figure 40 depicts Nourpeptidic DNA-templated multi-step synthesis. The reagent 
linkers used in steps 1, 2, and 3 were the diol linker, autocleaving Wittig linker, and sulfone 
linker, respectively; see Figure 1 for linker cleavage conditions. Conditions: 17 to 18: activate 
two equivalents 17 in 20 mM EDC, 15 mM sulfo-NHS, 0.1 M MES buffer pH 5.5, 1 M NaCl, 10 
min, 25 °C, then add to template in 0.1 M MOPS pH 7.5, IM NaCl, 16°C, 8 h; 19 to 21: tiiree 
equivalents 20, 0.1 M TAPS pH 9.0, 3 M NaCl, 48 h, 25 °C; 22 to 23: three equivalents 22, 0.1 
M TAPS pH 8.5, 1 M NaCl, 21 h, 25°C. The progress of each reaction and purification was 
followed by denaturing polyacrylamide gel electrophoresis (bottom). Lanes 3, 6, and 9: control 
reactions using reagents containing scrambled oligonucleotide sequences. 

[0061] Figure 41 depicts the use of nucleic acids to direct the synthesis of new polymers and 
plastics by attaching the nucleic acid to the ligand of a polymerization catalyst. The nucleic acid 
can fold into a complex structure which can affect the selectivity and activity of the catalyst. 
[0062] Figure 42 depicts the use of Grubbs' ring-opening metathesis polymerization 
catalysis in evolving plastics. The S3mthetic scheme of a dihydroimidazole ligand attached to 
DNA is shown as well as the monomer to be used in the polymerization reaction. 
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[0063] Figure 43 depicts the evolutioii of plastics through iterative cycles of ligand 
"divereificatibn, seiecfion and"amplificatiorrto creafepoljnmers witirdesired properties: 
[0064] Figure 44 depicts an exemplary scheme for the synthesis, in vitro selection and 
amplification of a library of compounds. 

[0065] Figure 45 depicts exemplary templates for use in recombination. 

[0066] Figure 46 depicts several exemplary deox3nibunucleotides and ribonucleotides 

bearing modifications to groups that do not participate in Watson-Crick hydrogen bonding and 

are known to be inserted with high sequence fidelity opposite natural DNA templates. 

[0067] Figure 47 depicts exemplary metal binding uridine and 7-deazaadenosine analogs. 

[0068] Figure 48 depicts the synthesis of analog (7). 

[0069] Figure 49 depicts the synthesis of analog (30). 

[0070] Figure 50 depicts the synthesis of 8-modified deoxyadenosine triphosphates. 

[0071] * Figure 51 depicts the results of an assay evaluating the acceptance of modified 

nuceotides by DNA polymerases. 

[0072] Figure 52 depicts the synthesis of 7-deazaadenosine derivatives. 
[0073] Figure 53 depicts certain exemplary nucleotide triphosphates. 

[0074] Figure 54 depicts a general method for the generation of libraries of metal-binding 
polymers. 

[0075] Figures 55 and 56 depict exemplary schemes for the in vitro selections for non- 
natural polymer catalysts^ 

[0076] Figure 57 depicts an exemplary scheme for the in vitro selection of catalysts for Heck 
reactions, hetero Diels-Alder reactions and aldol additions. 

Description of Certain Embodiments of the Invention 
[0077] As discussed above, it would be desirable to be able to evolve and amplify chemical 
compoimds including, but not limited to small molecules and polymers, in the same way that 
biopolymers such as polynucleotides and proteins can be amplified and evolved. It has been 
demonstrated that DNA-templated synthesis provides a possible means of translating the 
information in a sequence of DNA into a synthetic small molecule. In general, DNA templates 
linked to one reactant may be able to recruit a second reactive group linked to a complementary 
DNA molecule to yield a product. Since DNA hybridization is sequence-specij5c, the result of a 
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DNA-templated reaction is the translation of a specific DNA sequence into a corresponding 
reactionrprdductr"As^ 

catalyze the sequence-specific oligomerization of complementary oligonucleotides (T. Inoue ei 
al J. Am. Chem. Soc. 1981, 103, 7666; T. Inou et al J. Mol Biol 1984, 178, 669-76) has been 
demonstrated- This discovery was soon followed by fiiidings that DNA or KNA templates could 
catalyze the oligomerization of complementary DNA or RNA mono-, di-, tri-, or 
oligonucleotides (T. Inoue et ah J. Am. Chem. Soc. 1981, 705, 7666; L. E. Orgel et al Aca 
Chem. Res. 1995, 28, 109-1 18; H. Rembold et al J, Mol Evol 1994, 38, 205; L. Rodriguez et al 
J. Mol Evol 1991, 55, 477; C. B. Chen et al J, Mol Biol 1985, 181, 21V). DNA or RNA 
templates have since been shown to accelerate the formation of a variety of non-natural nucleic 
acid analogs, including peptide nucleic acids (C. Bohler et al Natwe 1995, 376, 578), 
phosphorothioate- (M. K. Herrlein et al X Am, Chem, Soc. 1995, 117, 10151-10152), 
phosphoroselenate- (Y. Xu et al J. Am. Chem. Soc. 2000, 122, 9040-9041; Y. Xu et al Nat. 
Biotechnol 2001, 19, 148-152) and phosphoramidate-(A. Luther et al Natiwe 1998, 396, 245-8) 
containing nucleic acids, non-ribose nucleic acids (M. Bolli et al Chem. Biol 1997, 4, 309-20), 
and DNA analogs in which a phosphate Unkage has been replaced with an aminoethyl group (Y. 
Gat et al Biopolymers 1998, 48, 19-28). Nucleic acid templates can also catalyze amine 
acylation between nucleotide analogs (R. K. Bruick et al Chem. Biol 1996, 5, 49-56). 
[0078] However, although the ability of nucleic acid templates to accelerate the foimation of 
a variety of non-natural nucleic acid analogues has been demonstrated, nearly all of these 
reactions previously shown to be catalyzed by nucleic acid templates were designed to proceed 
through transition states closely resembling the structure of the natural nucleic acid backbone 
(Fig. 2), typically affording products that preserve the same six-bond backbone spacing between 
nucleotide units. The motivation behind this design was presumably the assumption that the rate 
enhancement provided by nucleic acid templates depends on a precise alignment of reactive 
groups, and the precision of this ahgnment is maximized when the reactants and products mimic 
the structm-e of the DNA and RNA backbones. Evidence in support of the hypothesis that DNA- 
templated synthesis can only generate products that resemble the nucleic acid backbone comes 
from the well-known difiSculty of macrocyclization in organic synthesis (G. Illuminati et al Acc. 
Chem. Res. 1981, 14, 95-102; R. B. Woodward etal J. Am. Chem. Soc 1981, 103, 3210-3213). 
The rate enhancement of intramolecular ring closing reactions compared with their 
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intennolecular counterparts is known to diminish quickly as rotatable bonds are added between 

reactive"gfOupsrsuch"that^ 

rate acceleration (G. Illuminati et al Acc. Chem. Res. 1981, 14^ 95-102). 

[0079] Because synthetic molecules of interest do not in general resemble nucleic acid 
backbones, the use of DNA-templated synthesis to translate DNA sequences into synthetic small 
molecules would be broadly useful only if synthetic molecules other than nucleic acids and 
nucleic acid analogs coxild be synthesized in a DNA-templated fashion. The ability of DNA- 
templated synthesis to translate DNA sequences into arbitrary non-natural small molecules 
therefore requires demonstrating that DNA-templated synthesis is a much more general 
phenomenon than has been previously described. 

[0080] Signficantly, for the first time it has been demonstrated herein that DNA-templated 
synthesis is indeed a general phenomenon and can be used for a variety of reactions and 
conditions to generate a diverse range of compoimds, specifically including for the first time, 
compounds that are not, or do not resemble, nucleic acids or analogs thereof. More specifically, 
the present invention extends the ability to amplify and evolve libraries of chemical compoxmds 
beyond natural biopolymers. The ability to synthesize chemical compounds of arbitrary 
structure allows researchers to write then: own genetic codes incorporating a wide range of 
chemical functionality into novel backbone and side-chain structures, which enables the 
development of novel catalysts, dmgs, and polymers, to name a few examples. For example, the 
ability to directly amplify and evolve these molecules by genetic selection enables the discovery 
of entirely new families of artificial catalysts which possess activity, bioavailability, solvent, or 
themial stabiUty, or other physical properties (such as fluorescence, spin-labeUng, or 
photolability) which are difiBcult or impossible to achieve using the limited set of natural protein 
and nucleic acid building blocks. Similarly, developing methods to amplify- and directly evolve 
synthetic small molecules by iterated cycles of mutation and selection enables the isolation of 
novel ligands or drugs with properties superior to those isolated by traditional rational design or 
combinatorial screening drug discovery methods. Additionally, extending the approaches 
described herein to polymers of significance in material science would enable the evolution of 
new plastics. 

[0081] In general, the method of the invention involves 1) providing one or more nucleic 
acid templates, which one or more nucleic acid templates optionally have a reactive unit 
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associated therewith; and 2) contacting the one or noiore nucleic acid templates with one or more 
transfer units designed to have a first moiety, an anti-codon which hybridizes to a sequence of 
the nucleic acid, and is associated yvith a second moiety, a reactive unit, which includes specific 
fimctionality, a building block, reactant, etc. for the compound to be synthesized. It will be 
appreciated that in certain embodiments of the invention, the transfer unit comprises one moiety 
incorporating the hybridization capablility of the anti-codon unit and the chemical fimctionality 
of the reaction unit. Once these transfer units have hybridized to the nucleic acid template in a 
sequence-specific manner, the synthesis of the chemical compound can take place due to the 
mteraction of reactive imits present on the transfer units and/or the nucleic acid template. 
Significantly, the sequence of the nucleic acid can later be determined to decode the synthetic 
history of the attached compound and thereby its structure. It will be appreciated that the method 
described herein may be used to synthesize one molecule at a time or may be used to synthesize 
thousands to millions of compounds using combinatorial methods. 

[0082] It will be appreciated that a variety of chemical compounds can be prepared and 
evolved according to the method of the invention. In certain embodiments of the invention, 
however, the methods are utilized for the synthesis of chemical compounds that are not, or do 
not, resemble nucleic acids or nucleic acid analogs. For example, in certain embodiments of the 
invention, small molecule compounds can be syntheiszed by providing a template which has a 
reactive unit (e.g., building block or small molecule scaffold) associated therewith (attached 
directly or througli a linker as described in more detail in Examples 5 herein), and contacting the 
template simultaneously or sequentially with one or more transfer tmits having one or more 
reactive units associated therewith. In certain other embodiments, non-natural polymers can be 
synthesized by providing a template and contacting the template simultaneously with one or 
more transfer units having one or more reactive imits associated therewith xmder conditions 
suitable to effect reaction of the adjacent reactive units on each of the transfer units (see, for 
example. Figure 3, and examples 5 and 9, as described in more detail herein). 
[0083] Certain embodiments are discussed in more detail below; however, it will be 
appreciated that the present invention is not intended to be limited to those embodiments 
discussed below. Rather, the present invention is intended to encompass these embodiments and 
equivalents thereof. 
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Teinplates 

[0084] As discussed above, one or more templates are utilized in the method of the invention 
and hybridize to the transfer units to direct the synthesis of the chemical compoimd. As would 
be appreciated by one of skill in this art, any template may be used in the methods and 
compositions of the present invention. Templates which can be mutated and thereby evolved can 
be used to guide the synthesis of another chemical compound or library of chemical compounds 
as described in the present invention. As described in more detail herein, the evolvable template 
encodes the synthesis of a chemical compound and can be used later to decode the syntiietic 
history of the chemical compound, to indirectly amplify the chemical compound, and/or to 
evolve (z-e., diversify, select, and amplify) the chemical compound. The evolvable template is, 
in certain embodiments, a nucleic acid. In certain embodiment of the present invention, the 
template is based on a nucleic acid. ; ; 

[0085] The nucleic acid templates used in the present invention are made of DNA, RNA, a 
hybrid of DNA and KNA, or a derivative of DNA and RNA, and may be smgle- or double- 
stranded. The sequence of the template is used in the inventive method to encode the synthesis 
of a chemical compoimd, preferably a compoimd that is not, or does not resemble, a nucleic acid 
or nucleic acid analog (e.g., an unnatural polymer or a smsill molecule). In the case of certain 
ujonatural polymers, the nucleic acid template is used to align the monomer units in the sequence 
tibey will appear in the polymer and to bring them in close proximity with adjacent monomer 
units along the template so that tiiey will react and become joined by a covalent bond. In the 
case of a small molecule, the template is used to bring particular reactants within proximity of 
the small molecule scaffold in order that they may modify the scaffold in a particular way. In 
certain other embodiments, the template can be utilized to generate non-natural polymers by 
PGR amplification of a synthetic DNA template library consisting of a random region of 
nucleotides, as describe in Example 9 herein. 

[0086] As would be appreciated by one of skill in the art, the sequence of the template may 
be designed in a number of ways without going beyond the scope of the present invention. For 
example, the length of the codon must be determiaed and the codon sequences must be set If a 
codon length of two is used, then Using the four naturally occurring bases only 16 possible 
combinations are available to be used in encoding the library. If the length of the codon is 
increased to three (the number Nature uses in encoding proteins), the number of possible 



17 of 88 



wo 02/074929 




PCT/US02/08546 



combinatioBS is increased to 64. Other factors to be considered in determining the length of the 
codon are mismatching, frame-shifting, complexity of library, etc. As the length of the codon is 
increased up to a certain extent the number of mismatches is decreased; however, excessively 
long codons will hybridize despite mismatched base pairs. In certain embodiments of special 
interest, the length of the codon ranges between 2 and 10 bases. 

[0087] Another problem associated with using a nucleic acid template is frame shifting. In 
Nature, the problem of frame-shifting in the translation of protein from an mRNA is avoided by 
use of the complex machinery of the ribosome. The inventive methods, however, will not take 
advantage of such complex machinery. Instead, frameshifting may be remedied by lengthening 
each codon such that hybridization of a codon out of frame will guarantee a mismatch. For 
example, each codon may start with a G, and subsequent positions may be restricted to T, C, and 
A (Figure 4). In another example, each codon may begin and end with a G, and subsequent 
positions may be restricted to T, C, and A. Another way of avoiding frame shifting is to have the 
codons sufficiently long so that the sequence of the codon is only found within the sequence of 
the template ''in frame". Spacer sequences may also be placed in between the codons to prevent 
frame shifting. 

[0088] It will be appreciated that the template can vary greatly iii the nimber of bases. For 
example, m certain embodiments, the template may be 10 to 10,000 bases long, preferably 
between 10 and 1,000 bases long. The length of the template will of course depend on the length 
of the codons, complexity of the library, length of the unnatural polymer to be synthesized, 
complexity of the small molecule to be synthesized, use of space sequences, etc. The nucldc 
acid sequence may be prepared using any method known in the art to prepare nucleic acid 
sequences. These methods include both in vivo and in vitro methods including PGR, plasmid 
preparation, endonuclease digestion, solid phase synthesis, in vitro transcription, strand 
separation, etc. In certain embodiments, the nucleic acid template is synthesized using an 
automated DNA sjoithesizer. 

[0089] As discussed above, in certain embodiments of the invention, the method is used to 
synthesize chemical compounds that are not, or do not resemble, nucleic acids or nucleic acid 
analogs. Although it has been demonstrated that DNA-templated synthesis can be utilized to 
direct the synthesis of nucleic acids and analogs thereof, it has not been previously demonstrated 
that the phenomenon of DNA-tempalted synthesis is general enough to extend to other more 
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complex chemical compomids (e.g., small molecules, non-natural polymers). As described in 
detail herein, it has been demonstrated that DNA-templated synthesis is indeed a more general 
phenomenon and that a variety of reactions can be utilized. 

[0090] Thus, in certain embodiments of the present invention, the nucleic acid template 
comprises sequences of bases that encode the synthesis of an imnatural polymer or small 
molecule. The message encoded in tiie nucleic acid template preferably begins with a specific 
codon that bring into place a chemically reactive site firom which the polymerization can take 
place, or in the case of synthesizing a small molecide Ihe "starf ' codon may encode for an anti- 
codon associated with a small molecule scaffold or a first reactant. The "start" codon of the 
, present invention is analogous to the "start" codon, ATG, foxmd in Nature, which encodes for the 
amino acid methionine. To give but one example for use in synthesizing an unnatural polymer 
library, the start codon may encode for a start monomer unit comprising a primary amine masked 
by a photolabile protecting group, as shown below in Example 5A. 

[0091] In yet other embodiments of the invention, the nucleic acid template itself may be 
modified to include an initiation site for polymer synthesis (e.g., a nucleophile) or a small 
molecule scaffold. In certain embodiments, the nucleic acid template includes a hairpin loop on 
one of its ends ter minat ing in a reactive group used to initiate polymerization of the monomer 
imits. For example, a DNA template may comprise a hairpin loop terminating in a 5 '-amino 
group, which may be protected or not. From the amino group polymerization of the xmnatural 
polymer may coromence. The reactive amino group can also be used to link a small molecule 
scaffold onto the nucleic acid template in order to synthesize a small molecule library. 
[0092] To terminate the synthesis of the unnatural polymer a "stop" codon should be 
included in the nucleic acid template preferably at the end of the encoding sequence. The "stop" 
codon of the present invention is analogous to the "stop" codons (i.e., TAA, TAG, TGA) found 
in mRNA transcripts. In Nature, these codons lead to .the termination of protein synthesis. In 
certain embodiments, a "stop" codon is chosen that is compatible with the artificial genetic code 
used to encode the unnatural polymer. For example, the "stop" codon should not conflict with 
any other codons used to encode the synthesis, and it should be of the same general format as the 
other codons used in the template. The "stop" codon may encode for a monomer unit that 
terminates poljnnerization by not providing a reactive group for further attachment. For 
example, a stop monomer unit may contain a blocked reactive group such as an acetamide rather 
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than a primary amine as shown in Example 5A below. In yet other embodiments, the stop 
monomer unit comprises a biotinylated terminus providing a convenient way of terminating the 
polymerization step and purifying the resulting polymer. . 

Transfer Units 

[0093] As described above, in the method of the invention, transfer units are also provided 
which comprise an anti-codon and a reactive unit It will be appreciated that the anti-codons 
used in the present invention are designed to be coihplementary to the codons present within the 
nucleic acid template, and should be designed with the nucleic acid template and the codons used 
therein in mind. For example, the sequences used in the template as well as the length of the 
codons would need to be taken into account in designing the anti-codons. Any molecule which 
is complementary to a codoh used in the template may be used in the inventive methods (e.g., 
nucleotides or non-natural nucleotides). In certEun embodiments, the codons comprise one or 
more bases found in nature (z.e., thymidine, uracil, guanidine, c3^osine, adenine). In certain 
other embodiments, the anti-codon comprises one or more nucleotides normally found in Nature 
with a base, a sugar, and an optional phosphate group. In yet other embodiments, the bases are 
strung out along a backbone that is not the sugar-phosphate backbone normally foimd in Nature 
(e.g., non-natural nucleotides) . 

[0094] As discussed above, the anti-codon is associated with a particular type of reactive imit 
to form a transfer unit. It will be appreciated tiiat this reactive unit may represent a distinct entity 
or may be part of the functionality of the anti-codon unit (see. Example 9). In certain 
embodiments, each anti-codon sequence is associated with one monomer type. For example, the 
anti-codon sequence ATTAG may be associated with a carbamate residue with an iso-butyl side 
chain, and the anti-codon sequence CATAG may be associated with a carbamate residue with a 
phenyl side chain. This one-for-one mapping of anti-codon to monomer units allows one to 
decode any polymer of the library by sequencing the nucleic acid template used in the synthesis 
and gdlows one to synthesize the same polymer or a related polymer by knowing the sequence of 
the original polymer. It will be appreciated by one of skill in this art that by changing (e.g., 
mutating) the sequence of the template, different monomer units will be brought into place, 
thereby allowing the synthesis of related, polymers, which can subsequently be selected and 
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evolved. In certain preferred embodiments, several anti-codons may code for one monomer tmit 
as is the case in Nature. 

[0095] .In certain other embodiments of the present invention where a small molecule library 
is to be created rather than a polymer library, the anti-codon is associated with a reactant used to 
modify the small molecule scaffold. In certain embodiments, the reactant is associated with the 
anti-codon through a linker long enough to allow the reactant to come in contact with the small 
molecule scaffold. The linker should preferably be of such a length and composition to allow for 
intramolecular reactions and minimize intermolecular reactions. The reactants include a variety 
of reagents as demonstrated by the wide range of reactions that can be utilized in DNA- 
templated synthesis (see Example 2, 3 and 4 herein) and can be any chemical group, catalyst 
(e.g,y organometallic compounds), or reactive moiety (e.g., electrophiles, nucleophiles) known in 
the chemical arts. 

[0096] Additionally, the association between the anti-codon and the monomer unit or 
reactant in the transfer unit may be covalent or non-covalent. In certain embodiments of special 
intereste, the association is through a covalent bond, and in certain embodiments the covalent 
linkage is severable. The linkage may be cleaved by light, oxidation, hydrolysis, exposure to 
acid, exposure to base, reduction, etc. For examples of linkages used in this art, please see 
Fruchtel et al Angew. Chem. Int. Ed Engl 35:17, 1996, incorporated herein by reference. The 
anti-codon and the monomer unit or reactant may also be associated through non-covalent 
interactions such as ionic, electrostatic, hydrogen bonding, van der Waals interactions, 
hydrophobic interactions, pi-stacking, etc. and combinations thereof To give but one example, 
the anti-codon may be linked to biotin, and the monomer unit linked to streptavidin. The 
propensity of streptavidin to bind biotin leads to the non-covalent association between the anti- 
codon and the monomer unit to form the transfer unit. 

Synthesis of Certain Exemplary Compounds 

[0097] It will be appreciated that a variety of compounds and/or libraries can be prepared 
using the method of the invention. As discussed above, in certain embodiments of special 
interest, compoxmds that are not, or do not resemble, nucleic acids or analogs thereof, are 
synthesized according to the method of the invention. 
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[0098] In certain embodiments, polymers, specifically unnatural polymers, are prepared 
according to the method of the present invention. The uimatural polymers that can be created 
using the inventive method and system include any unnatural polymers. Exemplary unnatural 
polymers include, but are not limited to, polycarbamates, polyureas, polyesters, polyacrylate, 
polyalkylene polyethylene, polypropylene), polycarbonates, polypeptides with unnatural 
stereochemistry,. polypeptides with unnatural amino acids, and combination tiiereof- In certain 
embodiments, the polymers comprises at least 10 monomer units. In certain other embodiments, 
the polymers comprise at least 50 monomer units. In yet other embodiments, the polymers 
comprise at least 100 monomer units. The polymers synthesized using the inventive system may 
be used as catalysts, pharmaceuticals, metal chelators, materials, etc, 

[0099] In preparing certain imnatural polymers, the monomer imits attached to the anti- 
codons and used in the present invention may be any monomers or oligomers capable of being 
joined together to form a poljaner. The monomer imits may be carbamates, D-amino acids, 
imnatural amino acids, ureas, hydroxy acids, esters, carbonates, acrylates, ethers, etc. In certain 
embodiments, the nionomer units have two reactive groups used to link the monomer unit into 
the growing polymer chain. Preferably, the two reactive groups are not the same so that the 
monomer unit may be incorporated into the polymer in a directional sense, for example, at one 
end may be an electrophile and at the other end a nucleophile. Reactive groups may include, but 
are not limited to, esters, amides, carboxylic acids, activated carbonyl groups, acid chlorides, 
amines, hydroxyl groups, thiols, etc. In certain embodiments, the reactive groups are masked or 
protected (Greene & Wuts Protective Groups in Organic Synthesis, 3rd Edition Wiley, 1999; 
incorporated herein by reference) so that polymerization may not take place until a desired time 
when the reactive groups are deprotected. Once the monomer imits are assembled along the 
nucleic acid template, initiation of the polymerization sequence results in a cascade of 
polymerization and deprotection steps wherein the polymerization step results in deprotection of 
a reactive group to be used in the subsequent polymerization step (see. Figure 3). 
[00100] The monomer units to be polymerized may comprise two or more units depending on 
the geometry along the nucleic acid template. As would be appreciated by one of skill in this art, 
the monomer units to be polymerized mxist be able to stretch along the nucleic acid template and 
particularly across the distance spanned by its encoding anti-codon and optional spacer sequence. 
In certain embodiments, the monomer unit actually comprises two monomers, for example, a 
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dicarbamate, a dinrea, a dipeptide, etc. In yet other embodiments, liie monomer unit actually 
comprises three or more monomers. 

[00101] The monomer xmits may contain any chemical groups known in the art As would be 
appreciated by one of skill in this art, reactive chemical groups especially those that would 
interfere with polymerization, hybridization, etc. are masked using known protecting groups 
(Greene & Wuts Protective Groins in Organic Synthesis, 3rd Edition Wiley, 1999; incorporated 
herein by reference). In general, the protecting groups used to mask these reactive groups are 
orthogonal to those used in protecting the groups used in the polymerization steps. 
[00102] In synthesizing an uimatural polymer, in certain embodiments, a template is provided 
encoding the sequence of monomer imits. Transfer units are tlien allow to contact the template 
under conditions that allow for hybridization of the anti-codons to the template. Polymerization 
of the monomer units along the template is then allowed to occur to form the unnatural polymer. 
The newly synthesized polymer may then be cleaved j&om the anti-codons and/or the template. 
The template may be used as a tag to elucidate the structure of the polymer or may be used to 
amplify and evolve the unnatural polymer. As will be described in more detail below, the 
present method may be used to prepare a library of unnatural polymers. For example, in certain 
embodiments, as described in more detail in Example 9 herein, a library of DNA templates can 
be utilized to prepare unnatxiral polymers. In general, the method takes advantage of tihte fact that 
certain DNA polymerases are able to accept certain modified nucleotide triphosphate substrates 
and that several deoxyribonucleotides and ribonucleotides bearing modified groups that do not 
participate in Watson-Crick bonding are known to be inserted with high sequence specificity 
opposite natural DNA templates. Accordingly, single stranded DNA containing modified 
nucleotides can serve as efiScient templates for the DNA-poIymerase catalyzed incorporation of 
natural or modified nucleotides. 

[00103] It will be appreciated that the inventive methods may also be ixsed to synthesize other 
classes of chemical compounds besides unnatural polymers. For example, small molecules may 
be prepared using the methods and compositions provided by the present invention. These small 
molecules may be natural product-like, non-polymeric, and/or non-oligomeric. The substantial 
interest in small molecules is due in part to their use as the active ingredient in many 
pharmaceutical preparations although they may also be used as catalysts, materials, additives, 
etc. 
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[00104] In synthesiziiig small molecules using the method of the present invention, an 
evoivabie template is also provided. The template may either comprise a small molecule 
scaffold upon which the small molecule is to be built^ or a small molecule scaffold may be added 
to the template. The small molecule scaffold may be any chemical compound with sites for 
functionalization. For example^ the small molecule scaffold may comprises a ring system (e.g., 
the ABCD steroid ring system found in cholesterol) with functionalizable groups off the atoms 
making up the rings. In another example, the small molecxde may be the underlying structure of 
a pharmaceutical agent such as morphine or a cephalosporin antibiotic (see Examples 5C and 5D 
below below). The sites or groups to be functionalized on the small molecule scaffold may be 
protected using methods and protecting groups known in the art. The protecting groups used in a 
small molecule scaffold may be orthogonal to one another so that protecting groups can be 
removed one at a time. 

[00105] In this embodiment, the transfer units comprise an anti-codon similar to those 
described in the uimatural polymer synthesis; however, these anti-codons are associated with 
reactants or building blocks to be iised in modifying, adding to, or taking away from the- small 
molecule scaffold. The reactants or building blocks may be electrophiles (e.g.y acetyl, amides, 
acid chlorides, esters, nitriles, imines), nucleophiles {e.g.^ amines, hydroxyl groups, thiols), 
catalysts (e.g., organometallic catalysts), side chains, etc. See, for example reactions in aqueous 
and organic media as described herein in Examples 2 and 4. The ti;ansfer units are allowed to 
contact the template under hydridizing conditions, and the attached reactant or building block is 
allowed to react with a site on the small molecule scaffold. In certain embodiments, protecting 
groups on the small molecule template are removed one at a time from the sites to be 
functionalized so that the reactant of the transfer unit will react at only the desired position on the 
scaffold. As will be appreciated by one of skill in the art, the anti-codon may be associated with 
the reactant through a linker moiety (see. Example 3). Tlie linker facilitates contact of the 
reactant with the small molecule scaffold and in certain embodiments, depending on the desired 
reaction, positions DNA as a leaving group ("autocleavable" strategy), or may link reactive 
groups to the template via the "scarless" linker strategy (which yields product without leaving 
behind additional chemical functionality), or a "useful scar" strategy (in which the linker is left 
behind and can be functionalized in subsequent steps following linker cleavage). The reaction 
condition, linker, reactant, and site to be functionalized are chosen to avoid intermolecular 
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reactions and accelerate intraraolecular reactions. It will also be appreciated that the method of 
the present invention cbhteinplates both sequential and simuiraneoUs contacting of ihe cempiate 
with transfer units depending on the particular compound to be synthesized. In certain 
embodiments of special interest, the multi-step synthesis of chemical compounds is provided in 
which the template is contacted sequentially with two or more transfer units to facilitate multi- 
step synthesis of complex chemical compounds. 

[00106] After the sites on the scaffold have been modified, the newly synthesized small 
molecule is linked to the template that encoded is synthesis. Decoding of the template tag will 
allow one to elucidate the synthetic history and thereby the structure of the small molecule. The 
template may also be amplified in order to create more of the desired small molecule and/or the 
template may be evolved to create related small molecules. The small molecule may also be 
cleaved fi*om the template for purification or screening. 

[00107] As would be appreciated by one of skill in tliis art, a plurality of templates may be 
used to encode the synthesis of a combinatorial library of small moleciiles using the method 
described above. This would allow for the amplification and evolution of a small molecule 
library, a feat which has not been accomplished before the present invention. 

Method of Synthesizing Libraries of Compounds 

[00108] In the inventive method, a nucleic acid template, as described above, is provided to 
direct the synthesis of an unnatural polymer, a small molecule, or any other type of molecule of 
interest. In general, a plurality of nucleic acid templates is provided wherein the number of 
different sequences provided ranges firom 2 to 10^^. In one embodiment of the present invention, 
a pluraUty of nucleic acid templates is provided, preferably at least 100 different nucleic acid 
templates, more preferably at least 10000 different nucleic acid templates, and most preferably at 
least 1000000 different nucleic acid templates. Each template provided comprises a unique 
nucleic acid sequence used to encode the synthesis of a particular unnatural polymer or small 
molecule. As described above, the template may also have functionality such as a primary amine 
from which the polymerization is initiated or a small molecule scaffold. In certain embodiments, 
the nucleic acid templates are provided in one "pot". In certain other embodiments, the 
templates are provided in aqueous media, and subsequent reactions are perfonned in aqueous 
media. 
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[00109] To the template is added transfer miits with anti-codons, as described above, 
associated with a monomer unit, as described abovei. In certain enibodiments, a plurality of 
transfer units is provided so that there is an anti-codon for every codon represented in the 
template. In a preferred embodiment, certain anti-codons are used as start and stop sites. In 
general, a large enough number of transfer units is provided so that all corresponding codon sites 
on the template are filled after hybridization. 

[00110] The anti-codons of the transfer units are allowed to hybridize to the nucleic acid 
template thereby bringing the monomer units together in a specific sequence as determined by 
the template. In the situation where a small molecule library is being synthesized, reactants are 
brought in proximity to a small molecule scaffold. The hybridization conditions, as would be 
appreciated by those of skill in the art, should preferably allow for only perfect matching 
between the codon and its anti-codon. Even single base pair mismatches should be avoided. 
Hybridization conditions may include, but are not limited to, temperature, salt concentration, pH, 
concentration of template, concentration of anti-codons, and solvent. The hybridization 
conditions used in synthesizing the library may depend on the length of the codon/anti-codon, 
the similarity between the codons present in the templates, the content of G/C versus A/T base 
pairs, etc (for further information regarding hybridization conditions, please see. Molecular 
Cloning: A Laboratory Manual, 2nd Ed., ed. by Sambrook, Fritsch, and Maniatis (Cold Spring 
Harbor Laboratory Press: 1989); Nucleic Acid Hybridization (B. D. Hames & S. J. Higgins eds. 
1984); the treatise. Methods in Enzynwlogy (Academic Press, Inc., N.Y.); Ausubel et al Current 
Protocols in Molecular Biology (John Wiley & Sons, Inc., New York, 1999); each of which is 
incorporated herein by reference). 

[00111] After hybridization of the anti-codons to the codons on the template have occurred, 
the monomer units are then polymerized in the case of the synthesis of unnatural polymers. The 
polymerization of the monomer units may occur spontaneously or may need to be initiated, for 
example, by the deprotection of a reactive groups such as a nucleophile or by providing light of a 
certain wavelength. In certara other embodiments, polymers can be. catalyzed by DNA 
polymerization capable of effecting polymerization of non-natural nucleotides (see. Example 9). 
The polymerization preferably occurs in one direction along the template with adjacent monomer 
imits becoming joined through a covalent linkage. The termination of the polymerization step 
occurs by the addition of a monomer unit that is not capable of being added onto. In the case of 
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the synthesis of small molecules, the reactants are allowed to react with the small molecule 
scafibld. The reactant may react q)ontaiieously3 oir protecting groups on the reactant and/or the 
small molecule scaJSbld may need to be removed. Other reagents (e.g.y acid, base, catalyst, 
hydrogen gas, etc.) may also be needed to ejffect the reaction (see. Examples 5A-5E). 
[00112] After the unnatural polymers or small molecules have been created with the aid of the 
nucleic acid template, they may be cleaved from the nucleic acid template and/or anti-codons 
used to synthesize them. In certain embodiments, the polymers or small molecules are assayed 
before being completely detached from the nucleic acid templates that encode them. Once the 
polymer or small molecule is selected, the sequence of the template or its complement may be 
determined to elucidate the structure of the attached polymer or small molecule. This sequence 
may then be amplified and/or evolved to create new libraries of related polymers or small 
molecules that in tum may be screened and evolved. 

Uses 

[00113] The methods and compositions of the present invention represent a new way to 
generate molecules with desired properties. This approach marries the extremely powerfid 
genetic methods, which molecular biologists have taken advantage of for decades, with the 
flexibility and power of organic chemistry. The ability to prepare, amplify, and evolve unnatural 
polymers by genetic selection may lead to new classes of catalysts that possess activity, 
bioavailability, stability, fluorescence, photolability, or other properties that are difiBcult or 
impossible to achieve xising the limited set of building blocks found in proteins and nucleic acids. 
Similarly, developing new systems for preparing, amplifying, and evolving small molecules by 
iterated cycles of mutation and selection may lead to the isolation of novel ligands or drags with 
properties superior to those isolated by slower traditional drug discovery methods (see. Example 
7). 

[00114] Performing organic library synthesis on the molecular biology scale is a 
fundamentally different approach from traditional solid phase library synthesis and carries 
significant advantages. A library created using the inventive methods can be screened using any 
method known in this art (e.g., binding assay, catalytic assay). For example, selection based on 
binding to a target molecule can be carried out on the entire library by passing the library over a 
resin covalently linked to the target. Those biopolymers tliat have affinity for the resin-bound 
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target can be eluted with free target molecules, and the selected compounds can be amplified 
using the methods described above; Subsequent rounds of selection and amjpiincation can result 
in a pool of compoimds enriched with sequences that bind the target molecule. In certam 
embodiments, the target molecule mimics a transition state of a chemical reaction, and the 
chemical compounds selected may serve as a catalyst for the chemical reaction. Because the 
information encoding the synthesis of each molecule is covalently attached to the molecule at 
one end, an entire library can be screened at once and yet each molecule is selected on an 
individual basis. 

(00115] Such a library can also be evolved by introducing mutations at the DNA level using 
eiror-prone PGR (Cadwell et al PCR Methods Appl 2:28, 1992; incorporated herein by 
reference) or by subjecting the DNA to in vitro homologous recombination (Steimner Proc. Natl 
Acad, ScL USA 91:10747, 1994; Stemmer Nature 370:389, 1994; each of which is incorporated 
herein by reference). Repeated cycled of selection, amplification, and mutation may afford 
biopolymers with greatly increased binding afiBnity for target molecules or with significantly 
improved catalytic properties. The final pool of evolved biopolymers having the desired 
properties can be sequenced by sequencing the nucleic acid cleaved from the polymers. The 
nucleic acid-free polymers can be purified using any method known in the art including HPLC, 
column chromatography, FLPC, eta, and its binding or catalytic properties can be verified in the 
absence of covalently attached nucleic acid. 

[00116] The polymerization of synthetically-generated monomer imits independent of the 
ribosomai machinery allows the incorporation of an enormous variety of side chains with novel 
chemical, biophysical, or biological properties. Terminating each biopolymer with a biotin side 
chain, for example, allows the facile purification of only full-length biopolymers which have 
been completely translated by passing the library through an avidin-linked resin. Biotin- 
terminated biopoljoners can be selected for the actual catalysis of bond-breaking reactions by 
passing these biopolymers over resin linked through the substrate to avidia (Figure 5). Those 
biopolymers that catalyze substrate cleavage would self-elute from a column charged with this 
resin; Similarly, biotin-terminated biopolymers can be selected for the catalysis of bond-forming 
reactions (Figure 5). One substrate is linked to resin and the second substrate is linked to avidin. 
Biopolymers that catalyze bond formation between the substrates are selected by their ability to 
hgate the substrates together, resulting in attachment of the biopolymer to the resin. Novel side 
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chains can also be used to introduce cofactor into the biopolymers. A side chain containing a 
metal chelator, for example, may provide biopolymers vwith metal-mediated catalytic properties, 
while a flavin-containing side chain may equip biopolymers with the potential to catalyze redox 
reactions. 

[00117] In this maimer unnatural biopolymers may be isolated which serve as artificial 
receptors to selectively bind molecules or which catalyze chemical reactions. Characterization 
of these molecxiles would provide important insight into the ability of polycarbamates, polyureas, 
polyesters, polycarbonates, polypeptides with unnatural side chain and stereochemistries, or 
other imnatural polymers to form secondary or tertiary structures with binding or catalytic 
properties. 

Kits 

[00118] ' The present invention also provides kits and compositions for use in the inventive 
methods. The kits may contain any item or composition useful in practicing the present 
invention. The kits may include, but is not limited to, templates, anticodons, transfer tmits, 
monomer units, building blocks, reactants, small molecule scaffolds, buffers, solvents, enzymes 
(e.g., heat stable polymerase, reverse transcriptase, ligase, restriction endonuclease, exonuclease, 
Klenow fragment, polymerase, alkaline phosphatase, polynucleotide kinase), linkers, protecting 
groups, polynucleotides, nucleosides, nucleotides, salts, acids, bases, solid supports, or any 
combinations thereof. 

[00119] As would be appreciated by one of skill in this art, a kit for preparing unnatural 
polymers would contain items needed to prepare unnatural polymers using the inventive methods 
described herein. Such a kit may include templates, anti-codons, transfer units, monomers units, 
or combinations thereof. A kit for synthesizing small molecules may include templates, anti- 
codons, transfer units, building blocks, small molecule scaffolds, or combinations thereof, 
[00120] The inventive kit may also be equipped with items needed to amplify and/or evolve a 
polynucleotide template such as a heat stable polymerase for PGR, nucleotides, bufiFer, and 
primers. In certaia other embodiments, the inventive kit includes items commonly used in 
performing DNA shuffling such as polynucleotides, ligase, and nucleotides. 
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[00121] In addition to the templates and transfer iinits described herein, the present invention 

also includes compositions comprising complex small molecules, scaffolds, or unnatural polymer 

prepared by any one or more of the methods of the invention as described herein. 

« 

Equivalents 

The representative examples that follow are intended to help illustrate the invention, and 
are not intended to, nor should they be construed to, limit the scope of the invention. Indeed, 
various modifications of the invention and many further embodiments thereof, in addition to 
those shown and described herein, will become apparent to those skilled in the art from the full 
contents of this document, including the examples which follow and the references to the 
scientific and patent literature cited herein. It should further be appreciated that the contents of 
those cited references are incorporated herein by reference to help illustrate the state of the art^ 

The following examples contain important additional information, exemplification and 
guidance that can be adapted to the practice of this invention in its various embodiments and the 
equivalents thereof. 

Exemplification 

[00122] iExample 1; The GeneraMty of DNA-Templated Synthesis : Clearly, implementing 
the small molecule evolution approach described above requires establishing the generality of 
DNA-templated synthesis. The present invention, for the first time, establishes the generality fo 
this approach and thus enables the sjoitheis of a vareity of chemical compounds using DNA- 
templated synthesis. As shown in Figure 6a, the ability of two DNA architectures to support 
solution-phase DNA-templated synthesis was established. Both hairpin (H) and end-of-helix (E) 
templates bearing electrophilic maleimide groups reacted efficiently with one equivalent of thiol 
reagent linked to a complementary DNA oligonucleotide to yield the thioether product in 
Eoinutes at 25 ^C. DNA-templated reaction rates (^pp = -10^ IVT^s'^) were similar for H and E 
architectures despite significant differences in the relative orientation of their reactive groups. In 
contrast, no product was observed when using reagents containing sequence mismatches, or 
when using templates pre-quenched with excess P-mercaptoethanol (Fig. 6a). Both templates 
therefore support the sequence-specific DNA-templated addition of a thiol to a maleimide even 
though the structures of the resulting products differ markedly fi-om the structure of the natural 
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DNA backbone. Little or no non-templated intermolecular reaction products are observed under 
the reaction conditions (pH 7.5, 25 °C, 250 xnMNaCl, 60 nM template and reagent). 
[00123] Additionally, sequence-specific DNA-templated reactions spanning a variety of 
reaction types (Sn2 substitutions, additions to a,p-unsaturated carbonyl systems, and additions to 
vinyl sulfones), nucleophiles (thiols and amines), and reactant structures all proceeded in good 
yields with excellent sequence selectivity (Fig. 6b). Expected product masses were verified by 
mass spectrometry. In each case, matched but not mismatched reagents afforded product 
efficiently despite considerable variations in their transition state geometry, steric hindrance, and 
conformational flexibility. Collectively these findings indicate that DNA-templated synthesis is 
a general phenomenon capable of supporting a range of reaction t3rpes, and is not hmited to the 
creation of stmctures resembling nucleic acid backbones as described previously. 
[00124] Since sequence discrimination is important for the faithful translation of DNA into 
synthetic structures, tihe reaction rate of a matched reagent compared with that of a reagent 
bearing a single mismatched base near the center of its 10-base oligonucleotide was measured. 
At 25.°C, the initial rate of reaction of matched thiol reagents with iodoacetamide-linked H 
templates is 200-fold faster than that of reagents bearing a single mismatch (A^pp = 2.4 x 10"* M"^s" 
^ vs. 1.1 X 10^ M'^s"^ Fig. 7). In addition, small amounts of products arising from the annealing 
of mismatched reagents can be eliminated by elevating the reaction temperature beyond the Tm of 
the mismatched reagents (Fig. 7). The decrease in the rate of product formation as temperature 
is elevated further indicates that product formation proceeds by a DNA-templated mechanism 
rather than a simple intermolecular mechanism. 

[00125] In addition to reaction generality and sequence specificity!, DNA-templated synthesis 
also demonstrateis remarkable distance independence. Both H and E templates linked to 
maleimide or a-iodoacetamide groups promote sequence-specific reaction with matched, but not 
mismatched, thiol reagents amiealed anywhere on the templates examined thus far (up to 30 
bases away from the reactive group on the template). Reactants axmealed one base away react 
with similar rates as those annealed 2, 3, 4, 6, 8, 10, 15, 20, or 30 bases away (Fig. 8). In all 
cases, templated reaction rates are several hundred-fold higher than the rate of untemplated 
(mismatched) reaction (^app = lO'^-lO^ M"^s'^ vs. 5 x 10^ M"^s'^). At intervening distances of 30 
bases, products are efficiently formed presumably through transition states resembling 200- 
membered rings. These findings contrast sharply with the well-known difficulty of 
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jtnacrocyclization (see, for example, G. Illuminati et al Acc, Chem, Res. 1981, 14, 95-102; R. B. 
Woodward et al J. Am. Chem. Soc. 1981, 103, 3210-3213) in organic synthesis. 
[00126] To detennine the basis of the distance independence of DNA-templated synthesis, a 
series of modified E templates were first synthesized in which. the intervening bases were 
replaced by a series of DNA analogs designed to evaluate the possible contribution of (f) 
interbase interactions, (if) conformational preferences of the DNA backbone, (Hi) the charged 
phosphate backbone, and (fv) backbone hydrophilicity. Templates in which the intervening 
bases were replaced with any of the analogs in Fig. 9 had Uttle effect on the rates of product 
formation. These findings indicate that backbone structural elements specific to DNA are not 
responsible for the observed distance independence of DNA-templated synthesis. However, the 
addition of a 10-base DNA oUgonucleotide "clamp" complementary to the single-stranded 
intervening region significantly reduced product formation (Fig. 9), suggesting that the flexibility 
of this region is critical to efficient DNA-templated synthesis. 

[00127] The distance independent reaction rates may be explained if the bond-forming events 
in a DNA-templated format are sufiBciently accelerated relative to theh nontemplated 
counterparts such that DNA aimealing, rather than bond formation, is rate-deterinining. If DNA 
aimealing is at least partially rate limiting, then the rate of product formation should decrease as 
the concentration of reagents is lowered because annealing, unlike templated bond formation, is 
a bimolecular process. Decreasing the concentration of reactants in the case of the E template 
with one or ten intervening bases between reactive groups resulted in a marked decrease in the 
observed reaction rate (Fig. 10). This observation suggests that proximity effects in DNA- 
templated synthesis can enhance bond formation rates to the point that DNA aimealing becomes 
rate-detennining. 

[00128] These findings raise the possibility of using DNA-templated synthesis to translate in 
one pot Ubraries of DNA into solution-phase libraries of synthetic molecules suitable for PGR 
amplification and selection. The ability of DNA-templated synthesis to support a variety of 
transition state geometries suggests its potential in dh-ecting a range of powerful water- 
compatible synthetic reactions (see, Li, C.J. Organic Reactions in Aqueous Media, Wiley and 
Sons, New York: 1997). The sequence specificity described above suggests that mixtures of 
reagents may be able to react predictably with complementary mixtures of templates. Finally, 
the observed distance independence suggests that different regions of DNA "codons" may be 
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used to encode different groups on the same synthetic scaffold without impairing reactions rates. 
As a demonstration of this approach, a library of 1,025 maleimide-linked templates was 
synthdsized, each with a different DNA sequence in an ei^t-base encoding region (Fig. 11). 
One of these sequences, 5'-TGACGGGT-3', was arbitrarily chosen to code for the attachment of 
a biotin group to the template. A library of thiol reagents linked to 1,025 dififerent 
oligonucleotides was also generated. The reagent linked to 3'-ACTGCCCA-5' contained a 
biotin group, while the other 1,024 reagents contained no biotin. Equimolar ratios of all 1,025 
templates and 1,025 reagents were mixed in one pot for 10 minutes at 25 °C and the resulting 
products were selected in vitt*o for binding to streptavidin. Molecules surviving the selection 
were amplified by PGR and analyzed by restriction digestion and DNA sequencing. 
[00129] Digestion with the restriction endonuclease TspASl, which cleaves GTGAC and 
therefore cuts the biotin encoding template but none of the other templates, revealed a 1:1 ratio 
of biotin encoding to non-biotin encoding templates following selection (Fig. 12). This 
represents a 1,000-fold enrichment compared with the unselected library. DNA sequencing of 
the PGR amiplified pool before and after selection suggested a similar degree of enrichment and 
indicated that the biotin-encoding template is the major product after selection and amplification 
(Fig. 12). The ability of DNA-templated synthesis to support the simultaneous sequence-specific 
reaction of 1,025 reagents, each of which faces a 1,024:1 ratio of non-partner to partner 
templates, demonstrates its potential as a method to create synthetic libraries in one pot. The * 
above proof-of-principle translation, selection, and amplification of a synthetic library member 
having a specific property (avidin affinity in this example) addresses several key requirements 
for the evolution of non-natural small molecule libraries toward desired properties. 
[00130] Taken together, these results suggest that DNA-templated synthesis is a surprisingly 
general phenomenon capable of directing, rather than simply encoding, a range of chemical 
reactions to form products unrelated in stmcture to nucleic acid backbones. For several reactions 
examined, the DNA-templated format accelerates the rate of bond formation beyond the rate of a 
10-base DNA oUgonucleotide annealing to its complement, resulting in surprising distance 
independence. The facile nature of long-distance DNA-templated reactions may also arise in 
part from the tendency of water to contract the volume of nonpolar reactants (see, C.-J. Li et al. 
Organic Reactions in Aqueous Media, Wiley and Sons: New York, 1997) and from possible 
compactness of the intervening single^stranded DNA between reactive groups. These findings 
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may have implications for prebiotic evolution and for understanding the mechanisms of catalytic 
nucleic acids, which typically localize substrates to a strand of RNA or DNA. 
[00131] Methods: 

[00132] DNA synthesis. DNA oligonucleotides were synthesized on a PerSeptive 
Biosystems Expedite 8909 DNA synthesizer using standard protocols and purified by reverse 
phase HPLC. Oligonucleotides were quantitated spectrophotometrically and by denaturing 
polyacrylamide gel electrophoresis (PAGE) followed by staining with ethidium bromide or 
S YBR Green (Molecular Probes) and quantitation using a Stratagene Eagle Eye H densitometer. 
Phosphoramidites enabling the synthesis of 5'-NH2-dT, 5* tetrachlorofluorescein, abasic 
backbone spacer, C3 backbone spacer, 9-bond polyethylene glycol spacer, 12-bond saturated 
hydrocarbon spacer, and 5' biotin groups were purchased from Glen Research. Thiol-linked 
oligonucleotide reagents were synthesized on C3 disulfide controlled pore glass (Glen Research). 
[00133] Template functionalization. Templates bearing 5*-NH2-dT groups were transformed 
into a variety of electrophilic fimctional groups by reaction with tihie appropriate electrophile- 
NHS ester (Pierce). Reactions were performed in 200 mM sodium phosphate pH 7.2 with 2 
mg/mL electrophile-^fHS ester, 10% DMSO, and up to 100 |ig of 5'-amino template at 25 °C for 
1 h. Desired products were purified by reverse-phase HPLC and characterized by gel 
electrophoresis and MALDI mass spectrometry. 

[00134] DNA-tempIated synthesis reactions. Reactions were initiated by mixing equimolar 
quantities of reagent and template in buffer containing 50 mM MOPS pH 7.5 and 250 mM NaCl 
at the desired temperature (25 **C unless stated otiherwise). Concentrations of reagents and 
templates were 60 nM unless otherwise indicated. At various time points, aliquots were 
removed, quenched with excess p-mercaptoethanol, and analyzed by denaturing PAGE. 
Reaction products were quantitated by densitometry using their intrinsic fluorescence or by 
staining followed by densitometry. Representative products were also verified by MALDI mass 
spectrometry. 

[00135] In vitro selection for avidin binding. Products of the library translation reaction 
were isolated by ethanol precijpitation and dissolved in binding bxiffer (10 mM Tris pH 8, 1 M 
NaCl, 10 noM EDTA). Products were incubated with 30 fig of streptavidin-linked magnetic 
beads (Roche Biosciences) for 10 min at room temperature in 100 uL total volume. Beads were 
washed 16 times with binding buffer and eluted by treatment with 1 fxmol free biotin in 100 uL 
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binding buffer at 70 °C for 10 minutes. Eluted molecules were isolated by ethanol precipitation 
and amplified by standard PGR protocols (2 mM MgCh, 55 annealing, 20 cycles) using the 
primers 5'-TGGTGCGGAGCCGCCG and 5'-CCACTGTCCGTGGCGCGACCCCGGCTCC 
TCGGCTCGG. Automated DNA sequencing used the primer 5'- 

CCACTGTCCGTGGCGCGACCC. 

[00136] DNA Sequences, Sequences not provided in the figures are as follows: matched 
reagent in Fig. 6b SIAB and SBAP reactions: 5'-CCCGAGTCGAAGTCGTACC-SH; 
mismatched reagent in Fig. 6b SIAB and SBAP reactions: 5'-GGGCTCAGCTTCCCCATAA- 
SH; mismatched reagents for other reactions in Figs. 6b, 6c, 6d, and 8a; 5'-FAAATCTTCCC- 
SH (F= tetrachlorofluorescein); reagents in Figs. 6c and 6d containing one mismatch: 5 - 
FAATTCTTACC-SH; E templates in Figs. 6a, 6b SMCC, GMBS, BMPS, and SVSB reactions, 
and 8a: ' 5'-CNH2dT)- 

CGCGAGCGTACGCTCGCGATGGTACGAATTCGACTCGGGAATACCACCTTCGACTCG 
AGG; H template in Fig. 6b SIAB, SBAP, and SIA reactions: 5'-(NH2dT)- CGCGAGCGTACG 
CTCGCG ATGGTACGAATTC; clamp oligonucleotide in Fig 8b: 5'-ATTCGTACCA 

[00137] Example 2; Exemplary Reactions for Use in PNA-Templated Synthesis ; 
[00138] As discussed above, the generality of DNA-templated synthetic chemistry was 
examined (see, Liu et al J. Am, Chenu Soa 2001, 123, 6961). Specifically, the abiUty of DNA- 
templated sjmthesis to direct a modest collection of chemical reactions without requiring the 
precise alignment of reactive groups into DNA-like conformations was demonstrated. Indeed, 
the distance independence and sequence fidelity of DNA-templated synthesis allowed the 
simultaneous, one-pot translation of a model library of more than 1,000 templates into the 
corresponding thioedier products, one of which was em:iched by in vitro selection for binding to 
the protein streptavidin and amplified by PGR. 

[00139] As described in detail herein, the generahty of DNA-templated synthesis has been 
further expanded and it has been demonstrated that a variety of chemical reactions can be 
utilized for the constmction of small molecules and in particular, for the first time, DNA- 
templated organometaUic couplings and carbon-carbon bond forming reactions other than 
pyrimidine photodimerization. These reactions clearly represent an important step towards the in 
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vitro evolution of non-natural synthetic molecules by enabling the DNA-templated construction 
of a much more diverse set of structures than has previously been achieved. 
[00140] The ability of DNA-templated synthesis to direct reactions that require a non-DNA- 
linked activator, catalyst or other reagent in addition to the principal reactants has also been 
demonstrated herein. To test the ability of DNA-templated synthesis to mediate such reactions 
without reqiuring structural mimicry of the DNA-templated backbone, DNA-templated reductive 
aminations between an amine-Iinked template (1) and benzaldehyde- or glyoxal-linked reagents 
(3) with millunolar concentrations of NaBHaCN at room temperature in aqueous solutions can be 
performed. Significantly, products formed efficiently when the template and reagent sequences 
were complementary, while control reactions in which the sequence of the reagent did not 
complement that of the template, or hi which NaBHsCN was omitted, yielded no significant 
product (see Figures 13 and 14). Although DNA-templated reductive aminations to generate 
products closely mimicking the structure of double-stranded DNA have been previously reported 
(see, for example, X. Li et al J, Am Chem. Soc. 2002, 124, 746 and Y. Gat et al Biopolynners 
1998, 48, 19), the above results demonstrate that reductive amination to generate structures 
unrelated to the phosphoribose backbone can take place efficiently and sequence-specifically. 
Referring to Figure 15, PNA-templated aide bond formations between amine-linked templates 4 
and 5 and carboxylate-linked reagents 6-9 mediated by l-(3-dimethylaminopropyl)-3- 
ethylcarbodiimide (EDC) and N-hydroxylsulfosuccinimide (sulfo-NHS) to generate amide 
products in good yields at pH 6.0, 2S^C (Figure 15). Product formation was sequence-specific, 
dependent on the presence of EDC, and suprisingly insensitive to the steric encumbrance of the 
amine or carboxylate. Efficient DNA-templated amide formation was also mediated by the 
water-stable activator 4-(4,6-dimethoxy-l,3,5-trizin-2-yl)-4-methylmorphoiinium chloride 
(DMT-MM) instead of EDC and sulfo-NHS (Figures 14 and 15). The efficiency and generality 
of DNA-templated amide bond formation under these conditions, together with the large number 
•of commercially available chiral amines and carboxylic acids, make this reaction an attractive 
candidate in future DNA-templated syntheses of stmcturally diverse small molecule libraries. 
[00141] It will be appreciated that carbon-carbon bond forming reactions are also important in 
both chemical and biological syntheses and thus several such reactions are utilized in DNA- 
templated format. Both the reaction of nitroalkane-linked reagent '(10) with aldehyde-linked 
template (11) (nitro-aldol or Henry reaction) and the conjugate addition of 10 to maleimide- 
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linked template (12) (nitro-Michael addition) proceeded efficiently and with high sequence 
specificity at pH 7.5-8.5, 25°C (Figures 13 and 14). In addition, the sequence-specific DNA-: 
templated Wittig reaction between stabilized phosphorus ylide reagent 13 and aldehyde-linked 
templates 14 or 1 1 provided the corresponding olefin products in excellent yields at pH 6.0-8.0, 
25°C (Figures 13 and 14). Similarly, the DNA templated 1,3-dipolar cycloaddition between 
nitrone-linked reagents 15 and 16 and olefin-linked templates 12, 17 or 18 also afforded products 
sequence specifically at pH 7.5, 25°C (Figures 13 and 14). 

100142] In addition to Hie reactions described above, organometallic coupling reactions can 
also be utilized in the present invention. For example, DNA-templated Heck reactions were 
performed in the presence of water-soluble Pd precatalysts. In the presence of 170 mM 
Na2PdCl4, aryl iodide-linked reagent 19 and a variety of olefin-Unked templates including 
maleimide 12, acrylamide 17, vinyl sulfone 18 or cinnamamide '20 yielded Heck coupling 
products in modest yields at pH 5.0, 25°C (Figures 13 and 14). For couplings with olefins 17, 18 
and 20, adding two equivalents of P(p-S03C6H4)3 per equivalent of Pd prior to template and 
reagent addition* typically increased overall yields by 2-fold. Control reactions containing 
sequence mismatches or lacking Pd precatalyst yielded no product. To our knowledge, the above 
DNA-templated nitro aldol addition, nitro Michael addition, Wittig olefination, dipolar 
cycloaddition, and Heck coupling represent the first reported nucleic-acid templated 
organometallic reactions and carbon-carbon bond forming reactions other than pyrimidine 
photodimerization. 

[00143] It was previously discovered that the same DNA-templated reactions demonstrate 
distance independence, the ability to form product at a rate independent of the number of 
intervening bases between annealed reactants. It was hypothesized (Figure 16a) that distance 
independence arises when the rate of bond formation in the DNA-templated reaction is greater 
than the rate of template-reagent annealing. Although only a subset of chemistries fall into this 
category, any DNA-templated reaction that affords comparable product yields when the reagent 
is aimealed at various distances from the reactive end of the template is of special interest 
becaxise it can be encoded at a variety of template positions. To' evaluate the ability of the DNA- 
templated reactions developed above to take place efficiently when reactants are separated by 
distances relevant to library encoding, the yields of reductive amination, amide formation, nitro- 
aldol addition, nitro-Michael addition, Wittig olefination, dipolar cycloaddition, and Heck 



37 of 88 



wo 02/074929 




PCT/US02/08546 



coupling when zero or ten bases separated annealed reactive groups (Figure 16a, n=0 versus 
n=10) were compared. Among the reactions described above or in previous work, amide bond 
formation, nitro-aldol addition, Wittig olefination. Heck coupling, conjugate addition of thiols to 
maleimides and Sn2 reaction between thiols and a-iodo amides demonstrate comparable product 
formation when reactive groups are separated by zero or ten bases (Figure 16b). These findmgs 
indicate that these reactions can be encoded during synthesis by nucleotides that are distal from 
the reactive end of the template without significantly impairing product formation. 
[00144] In addition to the DNA-templated Sn2 reaction, conjugate addition, vinyl sulfone 
addition, amide bond formation, reductive amination, nitro-aldol (Henry reaction), nitro Michael, 
Wittig olefination, 1,3-dipolar cycloaddition and Heck coupling reactions described directly 
above, a variety of additional reagents can also be utilized in the method of the present invention. 
For example, as depicted in Figure 17, powrerful aqueous DNA-templated synthetic reactions 
including, but not limited to, the Levds acid-catalyzed aldol addition, Mannich reaction, 
Robinson annulation reactions, additions of allyl indium, zinc and tin to ketones and aldehydes, 
Pd-assisted allylic substitution, Diels-Alder cycloadditions, and hetero-Diels-Alder reactions can 
be utilized efficiently in aqueous solvent and are important complexity-biiilding reactions. 
[00145] Taken together, these results expand considerably the reaction scope of DNA- 
templated synthesis. A wide variety of reactions proceeded efficiently and selectively only when 
the corresponding reactants are programmed witia complementary sequences. By augmenting the 
repertoire of known DNA-templated reactions to now include carbon-carbon bond forming and 
organometallic reactions (nitro-aldol additions, nitro-Michael additions, Wittig olefinations, 
dipolar cycloadditions, and Heck couplings) in addition to previously reported amide bond 
formation (see, Schmidt et al Nucleic Acids Res, 1997, 25, 4792; Bruick et al Chem, Biol. 1996, 
3, 49), imine formation (Czlapinski et al. J. Am. Chem. Soc. 2001, J 23, 8618), reductive 
amination (Li et al J. Am. Chem. Soc. 2002, 124, 746; Gat et al. Biopolymers, 1998, 48, 19), Sn2 
reactions (Gartner et al J. Am. Chem. Soc. 2001, 725, 6961; Xu etal. Nat. Biotechnol. 2001, 19, 
148; Herrlein etal J. Am, Chem. Soc. 1995, 117, 10151) conjugate addition of thiols (Gartner et 
al J. Am. Chem. Soc. 2001, 123, 6961), and phosphoester or phosphonamide formation (Orgel et 
al Acc. Chem. Res. 1995, 28, 109; Luther et al Nature, 1998, 396, 245), tiiese results may 
enable the sequence-specific translation of libraries of DNA into libraries of stracturally and 
fimctionally diverse synthetic products. Since minute quantities of templates encoding desired 
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molecules can be amplified by PCR^ the yields of DNA-templated reactions are arguably less 
critical than the yields of traditional synthetic transformations. Nevertheless, many of the 
reactions developed above proceed efficiently. In addition, by demonstrating that DNA- 
templated synthesis in the absence of proteins can direct a large diversity of chemical reactions, 
these findings support previously proposed hypotheses that nucleic acid-templated synthesis may 
have translated replicable information into some of the earliest functional molecules such as 
polyketides, terpenes and polypeptides prior to the evolution of protein-based enzymes. The 
diversity of chemistry shown here to be controllable simply by bringing reactants into proximity 
by DNA hybridization without obvious structural requirements provides an experimental basis 
for these possibilities. The translation of amplifiable information into a wide range of structures 
is a key requirement for applying nature's molecular evolution approach to the discovery of non- 
natural molecxiles with new fimctions. 

[00146] Methods for Exemplary Reactions for Use in DNA-Tefnplated Synthesis: 
(001471 Functionalized templates and reagents were typically prepared by reacting S'-NHi 
terminated oligonucleotides (for template 1), 5'-NH2-(CH20)2 terminated oligonucleotides (for 
all other templates) or 3'-OP03-CH2CH(CH20H)(CH2)4NH2 terminated nuclotides (for all 
reagents) with the appropriate NHS esters (0.1 volumes of a 20 mg/mL solution in DMF) in 0.2 
M sodium phosphate buffer, pH 7.2, 25*^0, 1 h to provide the template and reagent stmctures 
shown in Figures 13 and 15. For amino acid linked reagents 6-9, 3'- 
OP03CH2CH(CH20H)(CH2)4NH2 terminated oligonucleotides in 0.2 M sodium phosphate 
buffer, pH 7.2 were reacted with 0.1 volumes of a 100 mM bis[2- 
(succimmidyloxycarbonyloxy)etibyl]sulfone (BSOCOES, Pierce) solution in DMF for 10 min at 
25°C, followed by 0.3 volumes of a 300 mM amino acid in 300 mM NaOH for 30 min at 25^C. 
[00148] Functionalized templates and reagents were purified by gel filtration using Sephadex 
G-25 followed by reverse-phase HPLC (0.1 triethylammonium acetate-acetonitrile gradient) and 
characterized by MALDI mass spectrometry. DNA templated reactions were conducted under 
the conditions described in Figures 13 and 15 and products were characterized by denaturing 
polyacrylamide gel electrophoresis and MALDI mass spectrometry. 

[00149] The sequences of oligonucleotide templates and reagents are as follows (5' to 3' 
direction, n refers to the number of bases between reactive groups when template and reagent are 
annealed as shown in Figure 16). 1: TGGTACGAATTCGACTCGGG; 2 and 3 matched: 
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GAGTCGAATTCGTACC; 2 and 3 mismatched: GGGCTCAGCTTCCCCA; 4 and 5: 

-GGTAeGAATreGAeTeGGGAATAeeAeeTTr6=9-matchedin^^ — 

matched (n = 0): AATTCGTACC; 6-9 mismatched: TCACCTAGCA; 11, 12, 14, 17, 18, 20: 
GGTACGAATTCGACTCGGGA; 10, 13, 16, 19 matched: TCCCGAGTCGAATTCGTACC; 
10, 13 16, 19 mismatched: GGGCTCAGCTTCCCCATAAT; 15 matched: AATTCGTACC; 15 
mismatched: TCGTATTCCA; template for n = 10 vs. n = 0 comparison: 
TAGCGATTACGGTACGAATTCGACTCGGGA 

[00150] Reaction yields quantitated by denaturing polyacrylamide gel electrophoresis 
followed by ehidixmi bromide staining, UV visnaUzation, and CCD-based densitometry of 
product and template starting material bands. Yield calculations assumed that templates and 
products stained with equal intensity per base; for those cases in which products are partially 
double-stranded during quantitation, changes in staining intensity may result in higher apparent 
yields. 

[00151] Example 3; Development of Exemplary Linkers 

[00152] As will be appreciated by one of ordinary skill in the art, it is frequently useful to 
leave the DNA moiety of the reagents linked to products during reaction development to 
facilitate analysis by gel electrophoresis. The use of DNA-templated synthesis to translate 
libraries of DNA into corresponding libraries of synthetic small molecules suitable for in vitro 
selection, however, requires the development of cleavable linkers connecting reactive groups of 
reagents with their decoding DNA oUgonucleotides. As described below and herein, three 
exemplary types of linkers have been developed (see. Figure 1 8). For reagents with one reactive 
group, it would be desirable to position DNA as a leaving group to the reactive moiety. Under 
this "autocleavable" linker strategy, the DNA-reactive group bond is cleaved as a natural 
consequence of the reaction. As but one example of this approach, a fluorescent Wittig 
phosphorane reagent (14, referring to Figure 19) was synthesized in which the decoding DNA 
oligonucleotide was attached to one of the aryl phosphine groups (see. Figure 19, left). DNA- 
templated Wittig reaction with aldehyde-linked templates resulted in the nearly quantitative 
transfer of the fluorescent group from the Wittig reagent to the template and the concomitant 
liberation of the alkene product from the DNA moiety of the reagent. Additionally, reagents 
bearing more than one reactive group can be linked to their decoding DNA oligonucleotides 
through one of two additional linker strategies. In the "scarless" linker strategy, DNA-templated 



40 of 88 



wo 02/074929 




PCT/US02/08546 



reaction of one reactive group is followed by cleavage of the linker attached through a second 
reactive group to yield products without leaving behind additional chemical functionality. For 
exanoiple, a series of amino acid reagents were syntheisized which were connected through a 
carbamoyletiiylsulfone linker to their decoding DNA oligonucleotides (Figure 19, center). 
Products of DNA-templated amide bond formation using these amino acid reagents were treated 
with aqueous alkaline buffer to effect the quantitative elimination and spontaneous 
decarboxylation of the carbamoyl group. The product of leaving this scarless linker is therefore 
the cleanly transferred amino acid moiety. In yet other embodiment of the invention, a third 
linker strategy, a "useful scar" may be utilized on the theory that it may be advantageous to 
introduce useful chemical groups as a consequence of linker cleavage. In particular, a "useful 
scar" can be functionalized in subsequent steps and is left behind following linker cleavage. For 
example, amino acid reagents linked through 1,2-diols to their decoding DNA oligonucleotides 
were generated. Following amide bond formation, this linker was quantitatively cleaved by 
oxidation with NaI04 to afiford products bearing useful aldehyde groups (see. Figure 19, right). 
In addition to the linkers described directiy above, a variety of additional linkers can be utilized. 
For example, as shown in Figure 20, a thioester linker can be generated by carbodiimide- 
mediated coupling of thiol-terminated DNA with carboxylate-containing reagents and can be 
cleaved with aqueous base. As the carboxylate group provides entry into the DNA-templated 
amide bond formation reactions described above, this linker would liberate a "useful scar'' when 
cleaved (see, Figure 20). Alternatively, the thioester linker can be used as an autocleavable 
linker during an amine acylation reaction in the presence of Ag(I) cations (see, Zhang et al J, 
Am. Chem. Soc. 1999, 727, 331 1-3320) since the thiol-DNA moiety of the reagent is liberated as 
a natural consequence of the reaction. It will be appreciated that a thioether linker that can be 
oxidized and eliminated at pH 11 to liberate a vinyl sulfone can be utilized as a "useful scar" 
linker. As demonstrated herein, the vinyl sulfone group serves as the substrate in a limnber of 
subsequent DNA-templated reactions. 

[00153] Example 4: Exemplary Reactions in Organic Solvents: 

[00154] As demonstrated herein, a variety of DNA-templated reactions can occm: in aqueous 
media. It has also been demonstrated, as discussed below, that DNA-templated reactions can 
occur in organic solvents, thus greatly expanding the scope of DNA-templated synthesis. 
Specifically, DNA templates and reagents have been complexed with long chain 
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tetraallcylammonitim cations (see, Jost et al Nucleic Acids Res. 1989, 17, 2143; Mel'mkov et ah 
Langmuir, 1999, 75, 1923-1928) to enable quantitative^ dissolution bflreactioa componente m 
anhydrous organic solvents including CH2CI2, CHCI3, DMF and MeOH. Surprisingly, it was 
found that DNA-templated synthesis can indeed occur in anhydrous organic solvents with high 
sequence selectivity. Depicted in Figure 21 are DNA-templated amide bond formation 
reacations in which reagents and templates are complexed with dimethyldidodecylaramonium 
cations either in separate vessels or after preannealing in water, lyophiUzed to dryness, dissolved 
in CH2CI25 and mixed together. Matched, but not mismatched, reactions provided products both 
when reactants were preannealed in aqueous solution and when they were mixed for the first 
time in CH2C1^ (see. Figure 21). DNA-templated amide formation and Pd-mediated Heck 
coupling in anhydrous DMF also proceeded sequence-specifically. Clearly, these observations 
of sequence-specific DNA-templated synthesis in organic solvents implies the presence of at 
least some secondary structure within tetraalkylanamonium-complexed DNA in organic media, 
and should enable DNA receptors and catalysts to be evolved towards stereoselective binding or 
catalytic properties in organic solvents. Specifically, DNA-templated reactions that are known to 
occur in aqueous media, including conjugate additions, cycloadditions, displacement reactions, 
and Pd-mediated couplings can also be performed in organic solvents. In certain other 
embodiments, reactions in organic solvents may be utilized that are inefficient or impossible to 
perform in water. For example, while Ru-catalyzed olefin metathesis in water has been reported 
by Grabbs and co-workers (see, Lynn et al J, Am. Chem. Soc. 1998, 120, 1627-1628; Lynn et ah 
J. Am, Chem, Soc. 2000, 122, 6601-6609; Mohr et al Orgaiwmetallics 1996, 75, 43 17-4325), the 
aqueous metathesis system is extremely fi.mctional group sensitive. The functional group 
tolerance of Ru-catalyzed olefin metathesis in organic solvents, however, is significantly more 
robust. Some exemplary reactions to utilize in organic solvents include, but are not limited to 
1,3 -dipolar cycloaddition between nitrones and olefins which can proceed through transition 
states that are less polar than ground state starting naaterials. 

[00155] As detailed above, the generahty of DNA-templated synthesis has been established by 
performing several distinct DNA-templated reaction types, none of which are limited to 
producing stnictures that resemble the natural nucleic acid backbone, and many of which are 
highly usefial carbon-carbon bond forming or complexity-building synthetic reactions. It has 
been shown that the distance independence of DNA-templated synthesis allows different regions 
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of a DNA template to each encode different synthetic reactions. DNA-templated synthesis can 
m?<intfl^n Sequence BdeUty^even in¥fibrary formatln whicS more tiian IjQOD templates and 
1,000 reagents react simultaneously in one pot. As described above and below, linker strategies 
have been developed, which together with the reactions developed as described above, have 
enabled the jBrst midti-step DNA-templated synthesis of simple synthetic small moleciiles. 
Additionally, the sequence-specific DNA-templated synthesis in organic solvents has been 
dehionstrated, further expanding the scope of this approach. 

[00156] Example 5: Synthesis of Exemplary Compomids and Libraries of Compounds: 
[00157] A) Synthesis of a Polycarbamate Library: One embodiment of the strategy 
described above is the creation of an amplifiable polycarbamate hbrary. Of the sixteen possible 
dinucleotides used to encode the library, one is assigned a start codon function, and one is 
assigned to serve as a stop codon. An artificial genetic code is then created assigning each of the 
up to 14 remaining dinucleotides to a different monomer. For geometric reasons one monomer 
actually contains a dicarbamate containing two side chains. Within each monomer, the 
dicarbamate is attached to the correspondmg dinucleotide (analogous to a tRNA anticodon) 
through a silyl enol ether linker which liberates the native DNA and the free carbamate upon 
treatment with fluoride. The dinucleotide moiely exists as the activated 5'-2-methylimidazole 
phosphate, that has been demonstrated (Inoue et al J. Mol Biol 162:201, 1982; Rembold et al 
J, Mol Evol 38:205, 1994; Chen et al J. Mol Biol 181:271, 1985; Acevedo etal J. Mol Biol 
197:187, 1987; Inoue et al J. Am. Chem, Soc, 103:7666, 1981; each of which is incorporated 
herein by reference) to serve as an excellent leaving group for template-directed oligomerization 
of nucleotides yet is relatively stable under neutral or basic aqueous conditions (Schwartz et al 
Science 228:585, 1985; incorporated herein by reference). The dicarbamate moiety exists in a 
cycUc form linked through a vinyloxycarbonate linker. The vinylcarbonate group has been 
demonstrated to be stable in neutral or basic aqueous conditions (Olofson et al Tetrahedron Lett, 
18:1563, 1977; Olofson ^zZ. Tetrahedj^on Lett. 18:1567, 1977; Olofson etal Tetrahedron Lett 
18:1571, 1977; each of which is incorporated herein by reference) and further has been shown to 
provide carbamates in very high yields upon the addition of amines (Olofson et al Tetrahedron 
Lett. 18:1 563, 1 977; incorporated herein by reference). 
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[00158] When attacked by an amine from a nascent polycarbamate chain, the vinyl carbonate 
linker, driven by the aromatization of w-cresol, liberates a free amine. This free amine 
subsequently serves as the nucleophile to attack the next vinyloxycarbonate, propagating the 
polymerization of the growing carbamate chain. Such a strategy minimizes the potential for 
cross-reactivity and bi-directional polymerization by ensuring that only one nucleophile is 
present at any time duriag polymerization. 

[00159] Using the monomer described above, artificial translation of DNA into a 
polycarbamate can be viewed as a three-stage process. In the jBrst stage, single stranded DNA 
templates encoding the library are used to guide the assembly and polymerization of the 
dinucleotide moieties of the monomers, terminating with the "stop" monomer which possesses a 
3 'methyl ether instead of a 3'hydroxyl group (Figure 22). 

[00160] Once the nucleotides have assembled and polymerized into double-stranded DNA, 
the "start" monomer ending in a o-nitrobenzylcarbamates is photodeprotected to reveal the 
primary amine that initiates carbamate polymerization. Polymerization proceeds in the 5' to 3* 
direction along the DNA backbone, with each nucleophilic attack resulting in the subsequent 
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unmasking of a new amine nucleophile.- Attack of the "stop" monomer liberates an acetamide 
rather than an amine^ffiereby, teraunafion 

stage exists in a stable double-stranded form, variables such as temperature and pH may be 
explored to optimize polymerization efficiency. 

[00161] Following polymerization the polycarbamate is cleaved IBrom the phosphate backbone 
of the DNA upon treatment with fluoride. Desilylation of the enol ether linker and the 
elimination of the phosphate driven by the resulting release of phenol provides the 
polycarbamate covalently linked at its carboxy terminus to its encoding single-stranded DNA 
(Figure 24). 

[00162] At this stage the polycarbamate may be completely liberated from the DNA by base 
hydrolysis of the ester linkage. The liberated polycarbamate can be purified by HPLC and 
retested to verify that its desired properties are intact. The free DNA can be amplified using 
PGR, mutated with error-prone PGR (Gadwell et al PCR Methods AppL 2:28, 1992; 
incorporated herein by reference) or DNA shuffling (Stemmer Proc. Natl. Acad. ScL USA 
91:10747, 1994; Stemsnsx Nature 370:389, 1994; U.S. Patent 5,811,238, issued September 22, 
1998; each of which is incorporated herein by reference), and/or sequenced to reveal the primary 
structure of the polycarbamate. 

[00163] Synthesis of monomer units. After the monomers are synthesized, the assembly and 
polymerization of the monomers on the DNA scaffold should occur spontaneously. Shikimic 
acid 1, available commercially, biosynthetically (Davis Adv. EnzymoL 16:287, 1955; 
incorporated herein by reference), or by short syntheses from D-majonose (Fleet et al. J. Chem. 
Soc„ Perkins Trans. 7 905, 1984; Harvey et al Teti^ahedron Lett. 32:4111, 1991; each of which 
is incorporated herem by reference), serves as a convenient starting point for the monomer 
synthesis. The syn hydroxy! groups are protected as the ^-methoxybenzylidene, and remaining 
hydroxyl group as the /e/t-butyldimethylsilyl ether to afford 2. The carboxylate moiety of the 
protected shikimic acid is then reduced completely by LAH reduction, tosylation of the resulting 
alcohol, and further reduction with LAH to provide 3. 
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[00164] Commercially available and synthetically accessible N-protected amino acids serve as 
the starting materials for the dicarbamate moiety of each monomer. Reactive side chains are 
protected as photolabile ethers, esters, aceteds, carbamates, or thioethers. Following chemistry 
previously developed (Cho et at. Science 261:1303, 1993; incorporated herein by reference), a 
desired amino acid 4 is converted to the corresponding amino alcohol 5 by mixed anhydride 
formation with isobutylchloroformate followed by reduction with sodium borohydride. The 
amino alcohol is then converted to the activated carbonate by treatment with 
nitrophenylchlorofonnate to afford 6, which is then coupled to a second amino alcohol 7 to 
provide, following hydroxyl group silylation and FMOC deprotection, carbamate 8. 




.NHFMOC 



[00165] Coupling of carbamate 8 onto the shikimic acid-derived linker proceeds as follows. 
The allylic hydroxyl group of 3 is deprotected with TBAF, treated with triflic anhydride to form 
the secondary triflate, then displaced with aminocarbamate 8 to afford 9. Presence of the vinylic 
methyl group in 3 should assist in minimizing the amoimt of undesired product resulting from 
Sn2' addition (Magid Tetrahedron 36:1901, 1980; incorporated herein by reference). Michael 
additions of deprotonated carbamates to a,P-unsaturated esters have been well documented 
(CoUado et al. Tetj^ahedron Lett 35:8037, 1994; Hirama et al J. Am, Chem. Soc, 107:1797, 
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1985; Nagasaka et al Heterocycles 29:155, 1989; Shishido et al J. Chem, Soc. Perkins Trans. I 
"993VT987rHirama et alTTIeterdcycM^^ 

reference). By analogy, the secondary amine is protected as the o-nitrobenzyl carbamate 
(NBOC), and the resulting compound is deprotonated at the carbamate nitrogen. This 
deprotonation can typically be performed with either sodium hydride or potassium tert- 
butyloxide (Collado et al Tetrahedron Lett. 35:8037, 1994; Hirama et al J, Attl Chem, Soc. 
107:1797, 1985; Nagasaka et al Heterocycles 29:155, 1989; Shishido et al J. Chem, Soc. 
Perkins Trans. I 993, 1987; Hirama et al Heterocycles 28:1229, 1989; each of which is 
incorporated herein by reference), although other bases may be utilized to minimize 
deprotonation of the nitrobenzylic protons. Additions of the deprotonated carbamate to a^P- 
unsaturated ketone 10, followed by trapping of the resulting enolate with TBS CI, should afford 
silyl enol ether 11. The previously found stereoselectivity of conjugate additions to 5-substituted 
enones such as 10 (House et al J. Org. Chem. 33:949, 1968; Still et al Tetrahedron 37:3981, 
1981; each of which is incorporated herein by reference) suggests that preferential formation of 
11 over its diastereomer. Ketone 10, the precursor to the fluoride-cleavable carbamate- 
phosphate linker, may be synthesized from 2 by one pot decarbo^cylation (Barton et al 
Tetrahedron 41:3901, 1985; incorporated herein by reference) followed by treatment with 
TBAF, Swem oxidation of the resulting alcohol to afford 12, deprotection with DDQ, selective 
nitrobenzyl ether formation of the less-hindered alcohol, and reduction of the a-hydroxyl group 
with samariinn iodide (Molander In Organic Reactions^ Paquette, Ed. 46:21 1, 1994; incorporated 
herein by reference). 
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[00166] The /7--methoxyben2ylidiene group of 11 is transformed into the a-hydroxy PMB ether 
using sodium cyanoborohydride and TMS chloride (Johansson etaL J, Chem. Soc, PerJdn Trans. 
I 2371, 1984; incorporated herein by reference) and the TES group deprotected with 2% HF 
(conditions that should not affect the TBS ether (Boschelli et al Tetrahedron Lett. 26:5239, 
1985; incorporated herein by reference)) to provide 13. The PMB group, following precedent 
(Johansson et al J, Ghent Soc. Ferkin Tram. 72371, 1984; Sutherlin et al Tetrahedron Lett. 
34:4897, 1993; each of v^ch is incorporated herein by reference), should remain on the more 
hindered secondary alcohol. The two free hydrQxyl groups may be macrocyclized by very slow 
addition of 13 to a solution of jc7-nitrophenyl chloroformate (or another phosgene analog), 
providing 14. The PMB ether is deprotected, and the resulting alcohol is converted into a triflate 
and eliminated under kinetic conditions with a sterically hindered base to afford 
vinyloxycarbonate 15. Photodeprotection of the nitrobenzyl either and nitroben2yl carbamate 
yields alcohol 16. 
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[00167] The monomer synthesis is completed by the sequential coupling of Ihree components. 
Chlorodiisopropylaminophosphine 17 is synthesized by the reaction of PCI3 with 
diisopropylamine (King et al J. Org. Chem. 49:1784, 1984; incorporated herein by reference). 
Resin-bound (or 3'-o-nitrobenzylether protected) nucleoside 18 is coupled to 17 to afford 
phosphoramidite 19. Subsequent coupling of 19 with the nucleoside 20 (Inoue et al J. Am. 
Chem. Soc. 103:7666, 1981; incorporated herein by reference) provides 21. Alcohol 16 is then 
reacted with 21 to yield, after careful oxidation using MCPBA or I2 followed by cleavage from 
the resin (or photodeprotection), the completed monomer 22. This strategy of sequential 
coupling of 17 with alcohols has been successfully used to generate phosphates bearing three 
different alkoxy substituents in excellent yields (Bannwarth et al Helv, Chim. Acta 70:175, 
1987; incorporated herein by reference). 
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[00168] The unique start and stop monomers used to initiate and terminate carbamate 
polymerization may be synthesized by simple modification of the above scheme. 
[00169] B) Evolvable Functionalized Peptide-Nucleic Acids (PNAs): In another 
embodiment an amplifiable peptide-nucleic acid library is created. Orgel and co-workers have 
demonstrated that peptide-nucleic acid (PNAs) oligomers are capable of eflBcient polymerization 
on complementary DNA or RNA templates (B6hler et al Nature 376:578, 1995; Schmidt et al 
Nitcl Acids Res. 25:4792, 1997; each of which is incorporated herein by reference). This 
finding, together with the recent synthesis and characterization of chiral peptide nucleic acids 
bearing amino acid side chains (Haaima etaL Angew. Chem, Int. Ed Engl 35:1939-1942, 1996; 
Piischl et al Tetrahedron Lett. 39:4707, 1998; each of which is incorporated herein by 
reference), allows the union of the polymer backbone and the growiag nucleic acid strand into a 
single structure. In this example, each template consists of a DNA hairpin terminating in a 5' 
amino group; the solid-phase and solution syntlieses of such molecules have been previously 
described (Uhlmann et al Angew, Chem. Int. Ed, Engl 35:2632, 1996; incorporated herein by 
reference). Each extension monomer consists of a PNA trimer (or longer) bearing side chains 
containing functionality of interest. An artificial genetic code is written to assign each 
trinucleotide to a dijfferent set of side chains. Assembly, activation (with a carbodiimide and 
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appropriate leaving group, for example), and polymerization of the PNA dimers along the 
"co^lem^iteE^DNXltCTa^^ 

polymer (Figure 20). Choosing a **stop" monomer with a biotinylated N-temiinus provides a 
convenient way of terminating the extension and purifying fiill-lengfh polymers. The resulting 
polymers, covalently linked to their encoding DNA, are ready for selection, sequencing, or 
mutation. 

[00170] The experimental approach towards implementing an evolvable fimctionalized 
peptide nucleic acid library comprises (i) improving and adapting known chemistry for the high 
efficiency template-directed polymerization of PNAs; (ii) defining a codon fonnat (length and 
composition) suitable for PNA coupling of a number of diverse monomers on a complementary 
strand of encoding DNA jfree from significant iudSdelity, framshifting, or spurious initiation of 
polymerization; (iii) choosing an initial set of side chains defining our new genetic code and 
synthesizing corresponding monomers; and (iv) subjecting a library of fimctionalized PNAs to 
cycles of selection, amplification, and mutation and characterizing the resulting evolved 
molecules to understand the basis of their novel activities. 

[00171] (i) Improving coupling chemistry: While Orgel and coworkers have reported 
template-directed PNA polymerization, reported yields and ntimber of successful couplings are 
significantly lower than would be desired. A promising route towards improving this key 
coupling process is exploring new coupling reagents, temperatures, and solvents which were not 
previously investigated (presumably because previous efforts focused on conditions which could 
have existed on prebiotic earth). The development of evolvable fimctionalized PNA polymers 
involves employing activators (DCC, DIG, EDC, HATU/DIEA, HBTU/DIEA, ByBOP/DIEA, 
chloroacetonitrile), leaving groups (2-methylimidazole, imidazole, pentafluorophenol, phenol, 
thiophenol, trifluoroacetate, acetate, toluenesulfonic acids, coenzyme A, DMAP, ribose), 
solvents (aqueous at several pH values, DMF, DMSO, chloroform, TFE), and temperature (0°C, 
4**C, 25°C, 37°C, SS^C) in a large combinatorial screen to isolate new coupling conditions. Each 
well of a 384-well plate is assigned a specific combination of one activator, leaving group, 
solvent, and temperature. Solid-phase synthesis beads covalently linked to DNA hairpin 
templates are placed in each well, together with a fluorescently labeled PNA monomer 
complementary to the template. A successfiil coupling event results in the covalent linking of 
the fluorophore to the beads (Figure 26); undesired non-templated coupling can be distinguished 
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by control reactions Ayith mismatched monomers. Following bead washing and cleavage of the 
product jfrom soHdTuppor^each well~is assayed^thaHuorescence plate readafi 
[00172] (ii) Defining a codon format: While Nature has successfiilly employed a triplet 
codon in protein biosynthesis, a new polymer assembled under very different conditions without 
the assistance of en2ymes may require an entirely novel codon format. Frameshifting may be 
remedied by lengthening each codon such that hybridizing a codon out of frame guarantees a 
mismatch (for example, by starting each codon with a G and by restricting subsequent positions 
in the codon to T, C, and A). Thermodynamically, one would also expect fidelity to improve as 
codon length increases to a certain point Codons that are excessively long, however, will be 
able to hybridize despite mismatched bases and moreover complicate monomer synthesis. An 
optimal codon length for high fidelity artificial translation can be defined using an optimized 
plate-based combinatorial screen developed above. The length and composition of each codon in 
the template is varied by sohd-phase synthesis of the appropriate DNA hairpin. These template 
hairpins are then allowed to couple with fluorescently labeled PNA monomers of varying 
sequence. The ideal codon format allows only monomers bearing exactly complementary 
sequences to couple with templates, even in the presence of mismatched PNA monomers (which 
are labeled dififerentiy to facilitate assaying of matched versiis mismatched coupling). Triplet 
and quadruplet codons in which two bases are varied among A, T, and C while the remaining 
base or bases are fixed as G to ensure proper registration dxiring polymerization are first studied. 
[00173] (iii) Writing a new genetic code: Side chains are chosen which provide interesting 
fimctionality not necessarily present in natural biopolymers, which are synthetically accessible, 
and which are compatible with coupling conditions. For example, a simple genetic code which 
might be used to evolve a Ni"*"^ chelating PNA consists of a variety of protected carboxylate- 
bearing side chains as well as a set of small side chains to equip piolymers with conformational 
flexibility and structural diversity (Figure 27). Successfiil selection of PNAs capable of binding 
Ni^^ with high affinity could be followed by an expansion of this genetic code to include a 
fluorophore as well as a fluorescent quencher. The resulting polymers could then be evolved 
towards a fluorescent Ni***^ sensor which possesses diflFerent fluorescent properties in the absence 
or presence of nickel. Replacing the fluorescent side chain with a photocaged one may allow the 
evolution of polymers that chelate Ni^^ in the presence of certain wavelengths of light or which 
release Ni"*"^ upon photolysis. These simple examples demonstrate the tremendous flexibility in 
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potential chemical properties of evolvable unnatural molecules conferred by the freedom to 
~incoipofSe"sjmt^^ 
machinery. 

[00174] (iy) Selecting for desired urmatural polymers: Many of the methods developed for 
the selection of biological molecules can be applied to selections for evolved PNAs with desired 
properties. Like nucleic acid or phage-display selections, libraries of unnatural polymers 
generated by the DNA-templated polymerization methods described above are self-tagged and 
therefore do not need to be spatially separated or synthesized on pins or beads. Ni binding 
PNA may be done simply by passing the entire library resulting from translation or a random 
oligonucleotide through commercially available Ni-NTA ("His-Tag") resin precharged with 
nickel. Desired molecules bind to the resin and are eluted with EDTA. Sequencing these PNAs 
after several cycles of selection, mutagenesis, and amplification reveals which of the initially 
chosen side chains can assemble together to form a Ni"^^ receptor. In addition,' the isolation of a 
PNA Ni^^ chelator represents the PNA equivalent of a histidine tag which may prove useful for 
the purification of subsequent unnatural polym^ers. Later efforts will involve more ambitious 
selections. For example, PNAs that fluoresce in the presence of specific ligands xnsy be selected 
by FACS sorting of translated polymers linked through their DNA templates to beads. Those 
beads that fluoresce in the presence^ but not in the absence, of the target ligand are isolated and 
characterized. Finally, the use of a biotinylated "stop" monomer as described above allows for 
flie direct selection for the catalysis of many bond-foraiing or bond-breaking reactions. Two 
examples depicted in Figure 28 outline a selection for a fimctionalized PNA that catalyzes the 
retroaldol cleavage of fructose 1,6-bisphosphate to glyceraldehyde 3-phosphate and 
dihydroxyacetone phosphate, an essential step in glycolysis, as well as a selection for PNA that 
catalyzes the converse aldol reaction. 

[00175] C) Evolvable Libraries of Small Molecules: In yet another embodiment of the 
present invention, the inventive methods are used in preparing amplifiable and evolvable 
unnatural nonpolymeric molecules including synthetic drag scaffolds. Nucleophilic or 
electrophilic groups are individually unmasked on a small molecule scaffold attached by simple 
covalent linkage or through a common solid support to an encoding oligonucleotide template. 
Electrophilic or nucleophilic reactants linked to short nucleic acid sequences are hybridized to 
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the corresponding templates. Sequence-specific reaction with the appropriate reagent takes place 
by proximity catalysis (Figure 29). 

[00176] Following synthetic fimctionalization of all positions in a maimer determined by the 
sequence of the attached DNA (Figures 30 & 31), the resultrng encoded beads may be subjected 
to wide range of biological screens which have been developed for assaying compounds on resin. 
(Gordon et al J. Med. Chem. 37:1385, 1994; Gallop et al J, Med. Chenu 37:1233-1251, 1994; 
each of which is incorporated herein by reference) 

[00177] Encoding DNA is cleaved from each bead identified in the screen and subjected to 
PGR, mutagenesis, sequencing, or homologous recombination before reattachment to a solid 
support. Ultimately, this system is most flexible when the encoding DNA is directly linked to 
the combinatorial synthetic scaffold without an intervening bead. In this case, entire libraries of 
compounds may be screened or selected for desired activities, their encoding DNA liberated, 
amplified, mutated, and recombined, and new compounds synthesized all in a small series of 
one-pot, massively parallel reactions. Without a bead support, however, reactivities of 
hybridized reactants must be highly efiBcient since only one template molecule directs the 
synthesis of the entire small molecule. 

[00178] The development of evolvable synthetic small molecule libraries relies on chemical 
catalysis provided by the proximity of DNA hybridized reactants. It will be appreciated that 
accq>table distances between hybridized reactants and unmasked reactive groups must first be 
defined for efiBcient DNA-templated fimctionalization by hybridizing radiolabeled electrophiles 
(activated esters in out first attempts) attached to short oligonucleotides at varying distances fiom 
a reactive nucleophile (a primary amine) on a strand of DNA. At given timepoints, aliquots are 
subjected to gel electrophoresis and autoradiography to monitor the course of the reaction. 
Plotting the reaction as a fimction of tlie distance (in bases) between the nucleophile and 
electrophile will define an acceptable distance window within which proximity-based catalysis 
of a DNA-hybridized reaction can take place. The width of this window will determine the 
number of distinct reactions we can encode on a strand of DNA (a larger window allows more 
reactions) as well as the nature of the codons (a larger window is required for longer codons) 
(Figure 32). 

[00179] Once acceptable distances between fimctional groups on a combinatorial synthetic 
scaffold and hybridizes reactants is determined, the codon format is determined. The 
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nonpolymeric nature of small molecule synthesis simplifies codon reading as frameshifting is not 
an issue and relatively large codons may be used to ensure that each set of reactants hybridizes 
only to one region of the encoding DNA strand. 

[00180] Once the distance of the linJcer between the functional group and synthetic small 
molecule scaffold and the codon format have been determined, one can synthesize small 
molecules based on a small molecule scaffold such as the cephalosporin scaffold shown in 
Figure 31. The primary amine of 7-aminocephalosporanic acid is first protected using FMOC- 
Cl, and then the acetyl group is hydrolyzed by treatment with base. The encoding DNA template 
is then attached through an amide linkage using EDC and HOBt to the carboxylic acid group. A 
transfer molecule with an anti-codon capable of hybridizing to the attached DNA template is 
then allowed to hybridize to the template. The transfer molcule has associated with it through a 
disulfide linkage a primary amine bearing Ri. Activation of the primary hydroxyl group on the 
cephalosporin scaffold witii DSC following treatment with TCEP affords the amine covalently 
attached to the scaffold through a carbamate linkage. Further treatment with another transfer 
unit having an anoiino acid leads to fimctionaliztion of the deprotected primary amine of the 
cephalosporin scaffold. Cephalosporin-like molecules sjmthesized in this maimer may then be 
selected, amplified, and/or evolved using the inventive methods and compositions. The DNA 
template may be diversified and evolved using DNA shuffling. 

[00181] D) Multi-Step Small Molecule Synthesis Programmed by DNA Templates: 

Molecular evolution requires the sequence-specific translation of an amplifiable information 
carrier into the stmctures of the evolving molecules. This requirement has limited the types of 
molecules that have been directly evolved to two classes, proteins and nucleic acids, because 
only these classes of molecules can be translated from nucleic acid sequences. As described 
generally above, a promising approach to the evolution of molecules other than proteins and 
nucleic acids uses DNA-templated synthesis as a method of translating DNA sequences into 
synthetic small molecules. DNA-templated synthesis can direct a wide variety of powerfiil 
chemical reactions with high sequence-specificity and without requiring structural naimicry of 
the DNA backbone. The application of this approach to synthetic molecules of useful 
complexity, however, requires the development of general methods to enable the product of a 
DNA-templated reaction to undergo ' subsequent DNA-templated transformations. The first 
DNA-templated multi-step small molecule syntheses is described in detail herein. Together with 
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recent advances in the reaction scope of DNA-templated synthesis, these findings set the stage 
for the in vitro evolution of synthetic small molecule libraries, 

[00182] Multi-step DNA-templated small molecule synthesis faces two major challenges 
beyond those associated with DNA-templated synthesis in general. First, the DNA used to direct 
reagents to appropriate templates must be removed fi:om the product of a DNA-templated 
reaction prior to subsequent DNA-templated synthetic steps in order to prevent undesired 
hybridization to the template. Second, multi-step synthesis often requires the purification and 
isolation of intermediate products^ yet common methods used to purify and isolate reaction 
products are not appropriate for multi-step synthesis on the molecular biology scale. To address 
these challenges, three distinct strategies were implemented in solid-phase organic synthesis, for 
linking chemical reagents with their decoding DNA oligonucleotides and two general approaches 
for product purification after any DNA-templated synthetic step were developed. 
[00183] When possible, an ideal reagent-oligonucleotide linker for DNA-templated synthesis 
positions the oligonucleotide as a leaving group of the reagent. Under this "autocleaving" linker 
strategy, the oligonucleotide-reagent bond is cleaved as a natural chemical consequence of the 
reaction (Fig. 33). As the first example of this approach applied to DNA-templated chemistry, ai 
dansylated Witdg phosphorane reagent (1) was synthesized in which the decoding DNA 
oligonucleotide was attached to one of the aryl phosphine groups (I. Hughes, Tetrahedron Lett, 
1996, 57, 7595). DNA-templated Wittig olefination (as described above) with aldehyde-linked 
template 2 resulted in the eflBcient transfer of the fluorescent dansyl group from the reagent to 
the template to provide olefin 3 (Fig. 33). As a second example of an autocleaving linker, DNA- 
linked thioester 4, when activated with Ag(I) at pH 7.0 (Zhang et al J. Am. Chem. Sac, 1999, 
727, 3311) acylated amino-terminated template 5 to afford amide product 6 (Fig. 33). 
Ribosomal protein biosynthesis uses aminoacylated tRNAs in a similar autocleaving linlcer 
format to mediate RNA-templated peptide bond formation. To purify desired products away 
firom unreacted reagents and fi-om cleaved oligonucleotides following DNA-templated reactions 
using autocleaving linkers, biotinylated reagent oligonucleotides and washing cmde reactions 
with streptavidin-linked magnetic beads (Fig. 34) were utilized. Although this approach does not 
separate reacted templates from unreacted templates, unreacted templates can be removed in 
subsequent DNA-templated reaction and purification steps (see below). 
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[00184] Reagents beaidng more than one functional group can be linked to their decoding 
DNA oligonucleotides througji a second and third linker strategies. In the "scarleiss linker" 
approach, one functional group of the reagent is reserved for DNA-templated bond fotmation, 
while the second functional, group is used to attach a linker that can be cleaved without 
introducmg additional unwanted chemical functionality. DNA-templated reaction is followed by 
cleavage of the linker attached through the second functional group to afford desired products 
(Fig. 33). For example, a series of aminoacylation reagents such as (D)-Phe derivative 7 were 
synthesized in which the a-amine is connected through a carbamoylethylsulfone linker (Zarling 
et al J. Immunology 1980, 124, 913) to its decoding DNA oligonucleotide. The product (8) of 
DNA-templated amide bond formation (as described herein) using this reagent and an amine- 
terminated template (5) vvas treated with aqueous base to ejffect the quantitative elimination and 
spontaneous decarboxylation of the linker, aflFording product 9 containing the cleanly transferred 
amino acid group (Fig. 33). This sulfone linker is stable in pH 7.5 or lower buffer at 25 for 
more than 24 h yet undergoes quantitative cleavage when e^qjosed to pH 11.8 buffer for 2 h at 37 

[00185] In some cases it may be advantageous to introduce new chemical groups as a 
consequence of linker cleavage. Under a third linker strategy, linker cleavage generates a 
"usefiil scar" that can be fimctionalized in subsequent steps. As an example of this class of 
linker, amino acid reagents such as the (L)-Phe derivative 10 were generated linked through 1,2- 
diols (Fmchart et al TetJ^ahedron Lett 1999, 40, 6225) to their decoding DNA oligonucleotides. 
Following DNA-templated amide bond formation with amine terminated template (5), this linker 
was quantitatively cleaved by oxidation with 50 mM aqueous NaI04 at pH 5.0 to afford product 
12 containing an aldehyde group appropriate for subsequent functionalization (for example, in a 
DNA-templated Wittig olejBnation, reductive amination, or nitrolaldol addition (Fig. 33). 
[00186] Desired products generated from DNA-templated reactions using the scarless or 
useful scar linkers can be readily purified using biotinylated reagent oligonucleotides (Fig. 34). 
Reagent ohgonucleotides together with desired products are first captured on streptavidin-linked 
magnetic beads. Any imreacted template bound to reagent by base pairing is removed by 
washing the beads with buffer containing 4 M guanidinium chloride. Biotinylated molecules 
remain bound to the streptavidin beads under these conditions. Desired product is then isolated 
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in pme fonn by eluting the beads with linker cleavage buffer (in the examples above, either pH 
i 1 bir NaI04-c6htaiiiing bujfter), wJflie reacted and iinreacted reagents remain bound to the beads, 
[00187] Integrating the recently expanded repertoire of synthetic reactions compatible with 
DNA-templated synthesis and the linker strategies described above, multi-step DNA-templated 
small moleciile syntheses can be conducted. 

[00188] In one embodiment, a solution phase DNA-templated synthesis of a non-natural 
peptide library is described generally belovyr and is shown generally in Figure 35. As shown in 
Figure 35, to generate the initial template pool for the library, thirty synthetic biotinylated 5'- 
amino oligonucleotides of the sequence format shown in Figure 35 are acylated with one of 
thirty different natural or unnatural amino acids using standard EDC coupling procedures. Four 
bases representing a "codon" within each eimino acylated primer are designated the identity of 
the side chain (Ri). The "genetic code" for these side chains are protected with acid labile 
protecting groups similar to those commonly used in peptide synthesis. The resulting thirty 
amino acylated DNA primers are annealed to a template DNA oligonucleotide library generated 
by automated DNA synthesis. Primer extension with a DNA polymerase followed by strand 
denaturation and purification with streptavidin-Iinked magnetic beads yield the starting template 
library (see. Figure 35). As but one general example, a solution phase DNA-templated synthesis 
of a non-natural peptide library is depicted in Figure 36. The template library is subjected to 
three DNA-templated peptide bond formation reactions using amino acid reagents attached to 
10-base decodiag DNA oligonucleotides through the sulfone linker described above. Products of 
each step are purified by preparative denaturing polyacrylamide gel electrophoresis prior to 
linker cleavage if desired, although this may not be necessary. Each DNA-linked reagent can be 
synthesized by coupling a 3 '-amino terminated DNA oligonucleotide to the encoded amino acid 
through the bis-NHS carbonate derivative of the sulfone linker as shown in Figure 37. Codons 
are again chosen so that related codons encode chemically similar amino acids. Following each 
peptide bond formation reaction, acetic anhydride is used to cap unreacted starting materials and 
pH 1 1 buffer is used to effect linker cleavage to expose a new aroino group for the next peptide 
bond formation reaction. Once the tetrapeptide is completed, those library members bearing 
carboxylate side chains can also be cyclized with their amino termini to form macrocyclic 
peptides, while linear peptide members can simply be N-acetylated (see Figure 36). 



58 of 88 



wo 02/074929 ^ ^ PCT/US02/08S46 

[00189] It will be appreciated that a virtually unlimited assortment of amino acid building 
blocks caii be incorporated into a non-idatural pejptide library. Unlike peptide libraries generated 
using the protein biosynthetic machinery such as phage displayed libraries (O'Neil et ah Cvrr. 
OpxTL Struct Biol 1995, 5, 443-9), mBNA displayed libraries (Roberts et al Proc. Natl AcacL 
Sci, USA 1997, 94, 12297-12302) ribosome displayed libraries (Roberts et al Curr. Opin. Chem. 
Biol 1999, J, 268-73; Schaffitzel et al J. Immunol MetJwds 1999, 257, 1 19-35), or intracellular 
peptide libraries (Norman et al Science 1999, 285^ 591-5), amino acids with non-proteinogenic 
side chains, non-natural side chain stereochemistry, or non-peptidic backbones can all be 
incorporated into this library. In addition, the many commercially available di-, tri- and 
oligopeptides can also be used as building blocks to generate longer library members. The 
presence of non-natural peptides in this library may confer enhanced pharmacological properties 
such as protease resistance compared with peptides generated ribosomally. Similarly, the 
macrocyclic library members may yield higher affinity ligands since the entropy loss upon 
binding their targets may be less than their more flexible linear coxmterparts. Based on the 
enormous variety of conomercially available amino acids fitting these descriptions, the maximum 
diversity of this non-natural cyclic and linear tetrapeptde library can exceed 100 x 100 x 100 x 
100 = 10* members. 

[00190] Another example of a library using the approach described above includes the DNA- 
templated synthesis of a diversity-oriented macrobicyclic library containing 5- and 14-membered 
rings (Figure 38). Starting material for this library consists of DNA templates atninoacylated 
with a variety of side-chain protected lysine derivatives and commercially available lysine 
analogs (including aminoethyl-cysteine, aminoethylserine, and 4-hydroxylysine). In the first 
step, DNA-templated amide bond formation with a variety of DNA-linked amino acids takes 
place as described in the non-natural peptide library, except that the vicinal diol linker 16 
described above is used instead of the traceless sulfone linker. Following amide bond formation, 
the diol linker is oxidatively cleaved with sodiimi periodate. Deprotection of the lysine 
analog' side chain amine is followed by DNA-templated anadde bond formation catalyzed by 
silver trifluoroacetate between the free amine and a library of acrylic derived thioesters. The 
resulting olefins are treated with a hydroxylamiae to fomi nitrones, which undergo 1,3-dipolar 
cycloaddition to yield the bicyclic library (Figure 38). DNA-linked reagents for this library are 
prepared by coupling lysine analogs to 5'-amino-terminated template primers (Figure 35), 
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coupling aminoacylated diol linkers to 3'-amino-terminated DNA oligonucleotides (Figure 38), 
and coupling acrylic acids to 3 '-thiol terminated DNA oligonucleotides (Figure 38). 
[00191] As but one example of a specific library generated firom the first general approach, 
described above, three iterated cycles of DNA-templated amide formation, traceless linker 
cleavage, and purification with streptavidin-Iinked beads were used to generate a non-natural 
tripeptide (Fig. 39). Each amino acid reagent was linked to a unique biotinylated 10-base DNA 
oligonucleotide through the sulfone linker described above. The 30-base amine-terminated 
template programmed to direct the tripeptide synthesis contained three consecutive, 10-base 
regions that were complementary to the three reagents, mimicking the strategy that woiild be 
used in a multi-step DNA-templated small molecule Ubrary synthesis. The first amino acid 
reagent (13) was combined with the template and activator 4-(4,6-dimethoxy-l,3,5-triazin-2-yl)- 
4-methylmorpholimum chloride (DMT-MM) (Kunishima et al Tetrahedron 2001, 57, 1551) to 
effect DNA-templated peptide bond formation. The desired product was purified by mixing the 
crude reaction with streptavidin-Iinked magnetic beads, washing with 4 M guanidinium chloride, 
and eluting with pH 11 bviffer to effect sxilfone linker cleavage, providing product 14. The fitee 
amine group in 14 was then elaborated in a second and third round of DNA-templated amide 
formation and linker cleavage to afford dipeptide IS and tripeptide 16 (Figure 39). 
[00192]' The progress of each reaction, purification, and sulfone linker cleavage step was 
followed by denaturing polyacrylamide gel electrophoresis. The final tripeptide linked to 
template (16) was digested with the restriction endonuclease EcdSl and the digestion fragment 
containing the tripeptide was characterized by MALDI mass spectrometry. Beginning with 2 
mnol (~ 20 ^g) of starting material, sufficient tripeptide product was generated to serve as the 
template for more than 10^ in vitro selections and PGR reactions (Kramer et al. in Current 
Protocols in Molecular Biology, Vol 3 (Ed.: F. M. Ausubel), Wiley, 1999, pp. 15.1) (assimiing 
1/10,000 molecules survive selection). No significant product was generated when the starting 
material template was capped with acetic anhydride, or when control reagents containing 
sequence mismatches were used instead of the complementary reagents (Fig. 39).* 
[00193] A non-peptidic multi-step DNA-templated small molecule synthesis (Fig. 40) that 
uses all three linker strategies developed above was also performed. An amine-tenninated 30- 
base template was subjected to DNA-templated amide bond formation using an aminoacyl donor 
reagent (17) containing the diol linker and a biotinylated 10-base oligonucleotide to afford amide 
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IS. The desired product was isolated by capturing the crude reaction on streptavidin beads 
foUoWed by cleaving the linker with Naib4 to generate aldehyde 19. The DNA-templated Wittig 
reaction of 19 with the biotinylated autocleaving phosphorane reagent 20 afforded fumaramide 
21. The products from the second DNA-templated reaction were partially purified by washing 
with streptavidin beads to remove reacted and unreacted reagent. In the third DNA-templated 
step, fumaramide 21 was subjected to a DNA-templated conjugate addition (Gartner et al. J. Am. 
Chem. Soa 2001, 723, 6961) using thiol reagent 22 linked through the sulfone linker to a 
biotinylated oligonucleotide. The desired conjugate addition product (23) was purified by 
immobilization with streptavidin beads. Linker cleavage with pH 11 buffer afforded final 
product 24 in 5-10% overall isolated yield for the three bond forming reactions, two linker 
cleavage steps, and three purifications (Figure 40). This final product was digested with jEcoRI 
and the mass of the small molecule-linked template firagment was confirmed by MALDI mass 
spectrometry (exact mass: 2568, observed mass: 2566±5). As in the tripeptide example, each of 
the three reagents used during this multi-step synthesis annealed at a unique location on the DNA 
template, and control reactioiis with sequence mismatches yielded no product (Fig. 40). As 
expected, control reactions in which the Wittig reagent was omitted (step 2) also did not generate 
product following the third step. Taken together, the DNA-templated syntheses of 16 and 24 
demonstrate the ability of DNA to direct the sequence-programmed multi-step synthesis of both 
oligomeric and non-oligomeric snaall molecules imrelated in structure to nucleic acids. 
[00194] The commercial availability of many substrates for DNA-templated reactions 
including amines, carboxylic acids, a-halo carbonyl compoimds, olefins, aUcoxyamines, 
aldehydes, and nitroallcanes may allow the translation of large libraries of DNA into diverse 
small molecule libraries. The direct one-pot selection of these libraries for members with desired 
binding or catalytic activities, followed by the PGR amplification and diversification of the DNA 
encoding active molecules, may enable synthetic small molecules to evolve in a manner 
paralleling the powerful methods Nature uses to generate new molecxilar fimction. In addition, 
multi-step nucleic acid-templated synthesis is a requirement of previously proposed models (A. 1. 
Scott, Tetrahedron Lett, 1997, 38, 4961; Li et al Nature 1994, 369, 218; Tamura et al. Proc. 
Natl Acad Sci USA 2001, 98, 1393) for the prebiotic translation of replicable information into 
functional molecules. These findings demonstrate that nucleic acid templates are mdeed capable 
of directing iterative or nonriterative multi-step small molecule synthesis even when reagents 
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anneal at widely varying distances from the growing molecule (in the above examples, zero to 
twenty baseis). As described in more detail below, libraries of synthetic liioiecuies can then be 
evolved towards active ligand and catalysts through cycles of translation, selection, amplification 
and mutagenesis. 

[00195] E) Evolving Plastics: In yet another embodiment of the present invention, a nucleic 
acid (e.g., DNA, RNA, derivative thereof) is attached to a polymerization catalyst. Since nucleic 
acids can fold into complex structures, the nucleic acid can be used to direct and/or affect the 
polymerization of a growing polymer chain-. For example, the nucleic acid may influence the 
selection of monomer units to be polymerized as well as how the polymerization reaction takes 
place (e.^., stereochemistry, tacticity, activity). The synthesized polymers may be selected for 
specific properties such molecular, weight, density, hydrophobicity^ tacticity, stereoselectivity, 
etc.^ and the nucleic acid which formed an integral part of the catalyst which directed its 
synthesis may be amplified and evolved (Figure 41). Iterated cycles of ligand diversification, 
selection, and amplification allow for the true evolution of catalysts and polymers towards 
desired properties. 

[00196] To give but one example, a library of DNA molecules is attached to Gmbbs' 
ruthenium-based ring opening metathesis polymerization (ROMP) catalyst through a 
dihydroimidazole ligand (Scholl et ah Org. Lett 1(6):953, 1999; incorporated herem by 
reference) creating a large, diverse pool of potential catalytic molecules, each unique by nature 
of the fimctionalized Ugand. Undpubtedly, fimctionalizing the catalyst witii a relatively large 
DNA-dehydroinoddazole (DNA-DHI) ligand will alter the activity of the catalyst. Each DNA 
molecule has the potential to fold into a unique stereoelectronic shape which potentially, has 
different selectivities and/or activities in the polymerization reaction (Figure 42). Therefore, the 
library of DNA Ugands can be "translated" into a library of plastics upon the addition of various 
monomers. In certain embodiments, DNA-DHI Ugands capable of covalently insertiag 
themselves into the growing polymer, thus creating a polymer tagged with the DNA that encoded 
its creation, are used. Using tiie synthetic scheme shown in Figure 42, DHI Ugands are produced 
containing two chemical handles, one used to attach the DNA to the Ugand, the other used to 
attach a pedant olefin to the DHI backbone. Rates of metathesis are known to vary widely based 
upon olefin substitution as well as the identity of the catalyst. Through alteration of these 
variable, the rate of pendant olefin incorporation can be modulated such that ^pendant oicfm metathesis 
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« ^OMP, thereby, allowing polymers of moderate to high molecular weights to be formed 
before insertioiS of the DNA Mg md ' corresponding polymer teimiiiation. Viiiiylic either aore 
commonly used in ROMP to functionalize the polymer termini (Gordon et al Chem. Biol 7:9- 
16, 2000; incorporated herein by reference), as well as produce polymers of decreased molecular 
weight 

[00197] Subseqxaent selection of a polymer from the library based on a desired property by 
electrophoresis, gel filtration, centrifugal sedimentation, partitioning into solvents of different 
hydrophobicities, etc. Amplification and diversification of the coding nucleic acid via 
techniques such as error-prone PGR or DNA shuffling followed by attachment to a DHI 
backbone will allow for production of another pool of potential ROMP catalysts enriched in the 
selected activity (Figure 43). This method provides a new approach to generating polymeric 
materials and the catalysts that create them. 

[00198] Example 6: Characterization of DNA-Templated Synthetic Small Molecule 
Libraries: T he non-natural peptide and bicyclic libraries described above are characterized in 
several stages. Each candidate reagent is conjugated to its decoding DNA oligonucleotide, tihien 
subjected to model reactions with matched and mismatched templates. The products from these 
reactions are analyzed by denaturing polyacrylamide gel electrophqresis to assess reaction 
efficiency, and by mass spectrometry to verify anticipated product structures. Once a complete 
set of robxist reagents are identified, the complete multi-step DNA-templated syntheses of 
representative single library members on a large scale is performed and the final products are 
characterized by mass spectrometry. 

[00199] More specifically, the sequence fidelity of each multi-step DNA-templated library 
synthesis is tested by following the fate of single chemically labeled reagents through the course 
of one-pot library synthesis reactions. For example, products arising from building blocks 
bearing a ketone group are captured with commercially available hydrazide-linlced resin and 
analyzed by DNA sequencing to verify sequence fideHty during DNA-templated synthesis. 
Similarly, when using non-biotinylated model templates, building blocks bearing biotin groups 
are purified after DNA-templated synthesis using streptavidin magnetic beads and subjected to 
DNA sequencing (Liu et al J, Am, Chem. Soc, 2001, 123, 6961-6963) Codons that show a 
greater propensity to anneal with nodsmatched DNA are identified by screening in this manner 
and removed from the genetic code of these synthetic libraries. 
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[00200] Example 7; In Vitro Selection of Protein Ligands jBrom Evolvable Synthetic 
Libraries: B ecause every library member generated in this approach is covalently linked to a 
DNA oligonucleotide that encodes and directs its synthesis, libraries can be subjected to true in 
vitro selections. Although direct selections for small molecule catalysts of bond-forming or 
bond-cleaving reactions are an exciting potential application of this approach, the simplest in 
vitro selection that can be used to evolve these libraries is a selection for binding to a target 
protein. An ideal initial target protein for the sjoithetic library selection both plays an important 
biological role and possesses known ligands of varying affinities for validating the selection 
methods. 

[00201] One receptor of special interest for use in the present invention is the.avPa receptor. 
The avp3 receptor is a member of the integrin family of transmembrane heterodimeric 
glycoprotein receptors (Miller et aL DrugDiscov Today 2000, 5, 397-408; Berman et al Membr 
Cell Biol 2000, 13, 207-44) The avPa integrin receptor is expressed on the surface of many cell 
types such as osteoclasts, vascular smooth muscle cells, endothelial cells, and some tumor cells. 
This receptor mediates several important biological processes including adhesion of osteoclasts 
to the bone matrix (van der Pluijm et al J, Bone Miner. Res. 1994, 9, 1021-8) smooth muscle 
cell migration (Choi et al J. Vase. Surg. 1994, 19, 125-34) and tumor-induced angiogenesis 
(Brooks et al Cell 1994, 79, 1157-64) (the outgrowth of new blood vessels). During tumor- 
induced angiogenesis, invasive endothelied cells bind to extracellular matrix components through 
their avps integrin receptors. Several studies (Brooks et al Cell 1994, 79, 1157-64; Brooks et 
al Cell 1998, 92, 391-400; Friedlander et al Science 1995, 270, 1500-2; Vamer et al CellAdhes 
Commun 1995, 5, 367-74; Brooks et al J. Clin Invest 1995, 96, 1815-22) have demonstrated that 
the inhibition of this integrin binding event with antibodies or small synthetic peptides induces 
apoptosis of the proliferative angiogenic vascular cells and can inhibit tumor metastasis. 
[00202] A number of peptide Ugands of varying affinities and selectivities for the avPa 
integrin receptor have been reported. Two benchmark avPs integrin antagonists are the linear 
peptide GRGDSPK (IC50 = 210 nM (Dechantsreiter et al J, Med. Chem. 1999, 42, 3033-40; 
Pfaff er al J, Biol Chem. 1994, 269, 20233-8) and the cyclic peptide cyclo-RGDfV (Pfaff e/^ al 
J, Biol Chem. 1994, 269, 20233-8)*(f = (D)-Phe, IC50 = 10 nM). While peptides antagonists for 
integrins commonly contain ROD, not all RGD-containing peptides are high affinity integrin 
ligands. Rather, the conformational context of RGD and other peptide sequences can have a 
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profound eflFect on integrin affinity and specificity (Wennuth et al J. Am. Chem, Soc. 1997, iiP, 
1328-1335; Geyer etal J. Am. Chem. Soc. 1994, 116, 7735-7743; Rai etal Bioorg. Med Chem. 
Lett 2001, 11, 1797-800; Rai et al Cun\ Med Chem. 2001, S, 101-19) For this reason, 
combinatorial approaches towards avPa integrin receptor antagonist discovery are especially 
promising. 

[00203] The biologically important and medicinally relevant role of the avPa integrin receptor 
together with its known peptide antagonists and its commercial availability (Chemicon 
International, Inc., Temecula, CA) make the aypa integrin receptor an ideal initial target for 
DNA-templated synthetic small molecule libraries. The aypa integrin receptor can be 
immobilized by adsorption onto microtiter plate wells without impairing its ligand binding 
ability or specificity (Dechantsreiter et al J. Med. Chem. 1999, 42, 3033-40; Wermuth et al J. 
Am. Chem. Soc. 1997, 119, 1328-1335; Kmhntxetal J. Am. Chem. Soc. 1996, 118, 7461-7472). 
Alternatively, the receptor can be immobilized by conjugation with NHS ester or maleimide 
groups covalently linked to sepharose beads and the abiUty of the resulting integrin aflSnity resin 
to maintain known ligand binding properties can be verified. 

[00204] To perform the actual protein binding selections, DNA template-linked synthetic 
peptide or macrocyclic libraries are dissolved in aqueous binding buffer in one pot and 
equilibrated in the presence of inmiobilized avPa integrin receptor. Non-binders are washed 
away with buffer. Those molecules that may be binding through their attached DNA templates 
rather than through their synthetic moieties are eliminated by washing the bound library with 
unfimctionalized DNA templates lacking PGR primer binding sites. Remaining ligands bound to 
the immobilized avPa integrin receptor are eluted by denaturation or by the addition of excess 
high affinity RGD-containing peptide ligand. The DNA templates that encode and direct the 
syntheses of avPa integrin binders are amplified by PGR using one primer designed to bind to a 
constant 3' region of the template and one pool of biotinylated primers fimctionalized at its 5' 
end with the library starting materials (Fig. 44). Purification of the biotinylated strand completes 
one cycle of synthetic molecule trajoslation, selection, and amplification, yielding a sub- 
population of DNA templates enriched in sequences that encode synthetic avp3 integrin ligands. 
[00205] For reasons similar to those that make the avPs integrin receptor an attractive initial 
target for the approach to generating synthetic molecules with desired properties, the factor Xa 
serine protease also serves as a promising protein target. Blood coagulation involves a complex 
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cascade of enzyme-catalyzed reactions that ultimately generate fibrin, the basis of blood clots 
(Rai et al Curr. Med Chem. 2001, 5, 101-109; Vacca et al Cutk Opin. Chem BioL 2000, 4, 
394-400) Thrombin is the serine protease that converts fibrinogen into fibrin during blood 
clotting. Thrombin, in turn, is generated by the proteolytic action of factor Xa on prothrombin. 
Because thromboembolitic (blood clotting) diseases such as stroke remain a leading cause of 
death in the world (Vacca et al Cum Opin Chem. BioL 2000, 4, 394-400) the development of 
dmgs that inhibit thrombin or factor Xa is a major area of pharmaceutical research. The 
inhibition of factor Xa is a newer approach thought to avoid the side effects associated with 
inhibiting thrombin, which is also involved in normal hemostasis (Maignan et al J. Med Chem, 
2000, 43, 3226-32; Leadley et al J. Cardiovasc. Pharmacol 1999, 34, 791-9; Becker et al 
Bioorg. Med Chem. Lett. 1999, 9, 2753-8; Choi-Sledeski et al Bioorg, Med, Chem. Lett 1999, 
9, 2539-44; Choi-Sledeski et al J. Med Chem, 1999, 42, 3572-87; Ewing et al J. Med Chem, 
1999, 42, 3557-71; Bostwick et al Thromb Haemost 1999, 81, Although many agents 

including heparin, hirudin, and hirulog have been developed to control the production of 
thrombin, these agents generally have the disadvantage of requiring intravenous or subcutaneous 
injection several a day in addition to possible side effects, and the search for synthetic 
small molecule factor Xa inhibitors remains the subject of great research effort. 
[00206] Among factor Xa inhibitors with known binding affinities are a series of tripeptides 
ending with arginine aldehyde (Marlowe et al Bioorg, Med Chein, Lett 2000, 70, 13-16) that 
are easily be included in the DNA-templated non-natural peptide library described above. 
Depending on the identities of the first two residues, these tripeptides exhibit IC50 values ranging 
from 15 nM to 60 ^lM (Marlowe et al Bioorg. Med. Chem. Lett 2000, 10, 13-16) and therefore 
provide ideal positive controls for validating and calibrating an in vitro selection for synthetic 
factor Xa Ugands (see below). Both factor Xa and active factor Xa immobilized on resin are 
commercially available (Protein Engineering Technologies, Denmark). The resiu-bound factor 
Xa is used to select members of both the DNA-templated non-natural peptide and bicyclic 
libraries with factor Xa affinity in a manner analogous to the integrin receptor binding selections 
described above. 

[00207] Following PGR amplification of DNA templates encoding selected synthetic 
molecules, additional rounds of translation, selection, and amplification are conducted to enrich 
the library for the highest affinity binders. The stringency of the selection is gradually increased 
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by increasing the salt concentration of the binding and washing buffers, decreasing the duration 
of binding, elevating the binding and washing temperatures, and ina-easing the concentration of 
washing additives such as template DNA or unrelated proteins. Importantly, in vitro selections 
can also select for specificity in addition to binding affinity. To eliminate those molecules that 
possess imdesired binding properties, library members bound to immobilized avPa integrin or 
factor Xa are w^ashed with non-target proteins such as other integrins or other serine proteases, 
leaving only those molecules that bind the target protein but do not bind non-target proteins. 
[00208] Iterated cycles of translation, selection, and amplification results in library enrichment 
rather than library evolution, which requires diversification between rounds of selection. 
Diversification of these synthetic libraries are achieved in at least two ways, both analogous to 
methods used by Nature to diversify proteins. Random point mutagenesis is performed by 
conducting the PGR amplification step imder error-prone PGR (Galdwell et al PCR Methods 
Applic. 1992, 2, 28-33) conditions. Because the genetic code of these molecules are written to 
assign related codons to related chemical groups, similar to the way that the natural protein 
genetic code is constructed, random point mutations in the- templates encoding selected 
molecules will diversify progeny towards chemically related analogs. In addition to point 
mutagenesis, synthetic libraries generated m this approach are also diversified tising 
recombination. Templates to be recombined have the stmcture shown in Fig. 45, in which 
codons are separated by five-base non-palindromic restriction endonuclease cleavage sites such 
as those cleaved by Avail (G/GWCC, W=A or T), 5^^961 (G/GNCG, N=A, G, T, or C), Ddel 
(C/TNAG), or HinVl (G/ANTC). Following selections, templates encoding desired molecules 
are enzymatically digested with these commercially available restriction enzymes. The digested 
fragments are then recombined into intact templates with T4 DNA ligase. Because the 
restriction sites separating codons are nonpalindromic, templates fragments can only reassemble 
to form intact recombined templates (Fig. 45). DNA-templated translation of recombined 
templates provides recombined small molecules. In this way, functional groups between 
synthetic small molecules with desired activities are recombined in a maimer analogous to the 
recombination of amino acid residues between proteins in Nature. It is well appreciated that 
recombination explores the sequence space of a molecule much more eflBciently than point 
mutagenesis alone (MinshuU et al Curr. Opin. Chem, Biol 1999, 5, 284-90; Bogarad et al Proc. 
Natl Acad. Sci. USA 1999, 96, 2591-5; Stenuner, W. Nature 1994, 570, 389-391). 
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[00209] Small molecule evolution using mutation and recombination oflfers two potential 
advantages over simple enrichment. If the total diversity of the library is much less than the 
number of molecules made (typically 10^^ to 10^^, every possible library member is present at 
the start of the selection. In this case, diversification is still usefid because selection conditions 
almost alv^ays change as rounds of evolution progress. For example, later rounds of selection 
will likely be conducted under higher stringencies, and may involve counter selections against 
binding non-target proteins. Diversification gives library members that have been discarded 
during earlier rounds of selection the chance to reappear in later rounds imder altered selection 
conditions in which their fitness relative to other members may be greater. In addition, it is quite 
possible using this approach to generate a sjoithetic library that has a theoretical diversity greater 
than 10*^ molecxiles. In this case, diversification allows molecules that never existed in the 
original Ubrary to emerge in later rounds of selections on the basis of their similarity to selected 
molecules, similar to the way in- which protein evolution searches the vastness of protein 
sequence space one small subset at a time. 

[00210J Example 8: Characterization of Evolved Compounds: F ollowing multiple rounds 
of selection, amplification, diversification, and translation, molecules surviving the selection will 
be characterized for their ability to bind the target proteiiL To identify the DNA sequences 
encoding evolved synthetic molecules surviving the selection, PCR-amplified templates are 
cloned into vectors, transformed into cells, and sequenced as individual clones. DNA 
sequencing of these subcloned templates reveal the identity of the synthetic molecules surviving 
the selection. To gain general information about the functional groups being selected during 
roimds of evolution, populations of templates are sequenced in pools to reveal the distribution of 
A, G, T, and C at every codon position. The judicious design of each fimctional group's genetic 
code allows considerable information to be gathered from population sequencing.' For example, 
a G at tiie first position of a codon may designate a charged group, while a C at this position may 
encode a hydrophobic substituent. 

[00211] To validate the integrin binding selection and to compare selected Ubrary members 
with known avPa integrin Ugands, linear GRGDSPK and a cyclic RGDfV analog (cyclic iso- 
ERGDfV) are also included iit-the DNA-templated cyclic peptide library. The selection 
conditions are adjusted until verification that libraries containing these known integrin ligands 
undergo enrichment of the DNA templates encoding the known ligands upon selection for 
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integrin binding. In addition, the degree of eiarichment of template sequences encoding these 
known avPs integrin ligands is correlated with their known aJBHnities and with the enrichment 
and affini ty of newly discovered avpa integrin ligands. 

[00212] Once the enrichment of tmiplate sequences encoding known and new integrin ligands 
is confirmed, novel evolved ligands will be synthesized by non-DNA templated synthesis and 
assayed for their avPa integrin receptor antagonist activity and specificity. Standard in vitro 
binding assays to integrin receptors (Dechantsreiter'e/ al J. Med Chem. 1999, 42, 3033-40) are 
performed by competing the binding of biotinylated fibrinogen (a natural integrin ligand) to 
immobilized integrin receptor with tibe ligand to be assayed. The inhibition of binding to 
fibrinogen is quantitated by incubation with an alkaline phosphatase-conjugated anti-biotin 
antibody and a chromogenic alkaline phosphate substrate. Comparison of the binding affinities 
of randomly chosen library members before and after selection will validate the evolution of the 
library towards target binding. Assays for binding non-target proteins reveal the ability of these 
libraries to be evolved towards binding specificity in addition to binding affinity. 
[00213] Similarly, the selection for factor Xa binding is validated by including the known 
factor Xa tripeptide inhibitors in the library design and verifying that a round of factor Xa 
binding selection and PGR amplification results in the enrichment of their associated DNA 
templates. Synthetic Ubrary members evolved to bind factor Xa are assayed in vitro for their 
ability to inhibit factor Xa activity. Factor Xa inhibition can be readily assayed 
spectrophotometrically using the commercially available chromogenic substrate S-2765 
(Chromogenix, Italy). 

[00214] While the DNA sequence alone of a non-natural peptide library member is likely to 
■ reveal the exact identity of the corresponding peptide, the final step in the bicyclic library 
synthesis is a non-DNA-templated intramolecular 1,3-dipolar cycloaddition that may yield 
diastereomeric pairs of regioisomers. Although modeling strongly suggests that only the 
regioisomer shown in Fig. 38 can form for steric reasons, facial selectivity is less certain. 
Diastereomeric purity is not a requirement for the in vitro selections described above since each 
molecule is selected on a single molecule basis. Nevertheless, it may be usefiil to characterize 
the diastereoselectivity of the dipolar cycloaddition. To accompUsh this, non-DNA-templated 
synthesis of selected bicyclic library members is performed, diastereomers are separated by 
chiral preparative HPLC, and product stereochemistry by nOe or X-ray diffraction is detemiined. 
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[00215] Example 9; Translating DNA into Non-Natural Polymers Using DNA 
Polymerases: An alternative approach to translating DNA into non-natural, evolvable polymers 
takes advantage of the ability of some DNA polymerases to accept certain modified nucleotide 
triphosphate substrates (D. M. Perrin et ah J. Am. Chem. Soa 2001, 725, 1556; D, M. Perrin et 
al Nucleosides Nucleotides 1999, 18, 377-91; T. Gourlain et al Nucleic Acids Res. 2001, 29, 
1898-1905; S. E- Lee et al Nucleic Acids Res, 2001, 29, 1565-73; K. Sakthievel et al Angew. 
Chem, Int. Ed. 1998, 57, 2872-2875). Several deoxyribonucleotides (Fig. 45) and ribonucleotides 
bearing modifications to groups that do not participate in Watson-Crick hydrogen bonding are 
known to be inserted with high sequence fidelity opposite natural DNA templates. Importantly, 
single-stranded DNA containing modified nucleotides can serve as efBcient templates for the 
DNA-polymerase-catalyzed incorporation of natural or modified mononucleotides. In one of the 
earliest examples of modified nucleotide incorporation by DNA polymerase, Toole and co- 
workers reported the acceptance of 5-(l-pentynyl)-deoxyuridine 1 by Vent DNA polymerase 
vmder PGR conditions (J. A. Lalham et al Nucleic Acids Res. 1994, 22, 2817-22). Several 
additional 5-functionalized deoxyuridines (2-7) derivatives were subsequently found to be 
accepted by thermostable DNA polymerases suitable for PGR (K. Sakthievel et al Angew. 
Chem. Int. Ed. 1998, 57, 2872-2875). The first fimctionalized purine accepted by DNA 
polymerase, deoxyadenosine analog 8, was incorporated into DNA by T7 DNA polymerase 
together with deoxyuridine analog 7 (D. M. Perrin et al Nucleosides Nucleotides 19^9, 18, 377- 
91). DNA libraries containing both 7 and 8 were successfully selected for metal-independent 
RNA cleaving activity (D. M. Perrin et al J. Am, Chem. Soc. 2001, 123, 1 556-63). Williams and 
co-workers recently tested several deoxyuridine derivatives for acceptance by Taq DNA 
polymerases and concluded that acceptance is greatest when using C5-modified \iridines bearing 
rigid alkyne or /ra;w-alkene groups such as 9 and 10 (S. E. Lee et al Nucleic Acids Res. 2001, 
29, 1565-73). A similar study (T. Gourlain et al Nucleic Acids Res. 2001, 29, 1898-1905) on 
G7-fijnctionalized 7-deaza-deoxyadenosines revealed acceptance by Taq DNA polymerase of 7- 
aminopropyl- (11), cz5-7-aminopropenyl- (12), and 7-aminopropynyl-7-dea2adeoxyadenosine 
(13). 

[00216] The fimctionalized nucleotides incorporated by DNA polymerases to date, shown in 
Fig. 46, have focused on adding "protein-like" acidic and basic fimctionahty to DNA. While 
equipping nucleic acids with general acid and general base functionality such as primary amine 
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and carboxylate groups may increase the capability of nucleic acid catalysts, the functional 
groups present in natural nucleic acid bases already have demonstrated the ability to serve as 
general acids and bases. The hepatitis delta ribozyme, for example, is thought to use the pK^r 
modulated endocyclic amine of cytosine 75 as a general acid (S. Nakano et al Science 2000, 
287, 1493-7) and the peptidyl transferase activity of the ribosome may similarly rely on general 
base or general acid catalysis (G. W. Muth et al Science 2000, 289, 947-50; P. Nissen et al 
Science 2000, 289, 920-930; N. Ban et al Science 2000, 289, 905-920) although the latter case 
remains the subject of ongoing debate (N. Polacek et al Natia^e 2001, 411, 498-501). Equipping 
DNA bases with additional Br0nsted acidic and basic groups, therefore, may not profoundly 
expand the scope of DNA catalysis. 

[00217] In contrast with simple general acid and general base functionality, chiral metal 

centers would expand considerably the chemical scope of nucleic acids. Functionality aimed at 
» 

binding chemically potent metal centers has yet to been incorporated into nucleic acid polymers. 
Natural DNA has demonstrated the ability to fold in complex three-dimensional structures 
capable of stereospecifically binding target molecules (C. H. Lin et al Chem, Biol 1997, ^,817- 
32; C. H. Lin et al Chem. Biol 1998, 5, 555-72; P. Schultze et al J. Mol Biol 1994, 255, 1532- 
47) or catalyzing phosphodiester bond manipiilation (S. W. Santoro et al Proc. Natl Acad. Scl 
USA 1997, 94, 4262-6; R. R. Breaker et al Chem. Biol 1995, 2, 655-60; Y. Li et al 
Biochemistry 2000, 39, 3106-14; Y. Li et al Proc. Natl Acad Scl USA 1999, P5, 2746-51). 
DNA depxirination (T. L. Sheppard et al Proa Natl Acad. Scl USA 2000; 97, l%Q^-nmT) and 
porphyrin metallation (Y. Li et al Biochemistry 1997, 36, 5589-99; Y. Li et al Nat. Struct. Biol 
1996, 3, lA^-l). Non-natural nucleic acids augmented with the ability to bind chemically potent, 
water-compatible metals such Cu, La, Ni, Pd, Rh, Ru, or Sc may possess greatly expanded 
catalytic properties. For example, a Pd-binding oligonucleotide folded into a well-defined 
structure may possess the ability to catalyze Pd-mediated coupling reactions with a high degree 
of regiospecificity or stereospecificity. Similarly, non-natural nucleic acids that form chiral Sc 
binding sites may serve as enantioselective cycloaddition or aldol addition catalysts. The ability 
of DNA polymerases to translate DNA sequences into .these non-natural polymers coupled with 
in vitro selections for catalytic activities would therefore enable the direct evolution of desired 
catalysts firom random libraries. 
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[00218] Evolving catalysts in this approach addresses the difficxilty of rationally designing 
catai)^^ 

approaches (K. W. Kuntz et ah Curr. Opim Ckem. Biol 1999, 5, 313-319; M. B. Francis et al 
Curr. OpirL Chem, Biol 1998, 2, 422-8) to organometallic catalyst discov^. For example, 
Hoveyda and co-workers identified Ti-based enantioselective epoxidation catalysts by serial 
screening of peptide ligands (K. D. Shimizu et al Angew. Chem. Int. Ed. 1997, 56) Serial 
screening was also used by Jacobsen and co-workers to identify peptide ligands that form 
enantioselective epoxidation catalysts when complexed with metal cations (M. B. Francis et al 
Angew, Chem, Int Ed. Engl 1999, 38, 937-941) Recently, a peptide Ubrary containing 
phosphine side chains was screened for the ability to catalyze malonate ester addition to 
cyclopentenyl acetate in the presence of Pd (S. R. Gilbertson et al J. Am. Chem. Sac. 2000, 122, 
6522-6523). The current approach dijBfers fundamentally from previous combinatorial catalyst 
discovery efforts, however, in that it enables catalysts with desired properties to spontaneously 
emerge from one pot, solution-phase libraries after evolutionary cycles of diversification, 
amplification, translation, and selection. This strategy allows up to 10^^ different catalysts to be 
generated and selected for desired properties in a single experiment. The compatibility of our 
approach with one-pot in vitro selections allows the direct selection for reaction catalysis rather 
than screening for a phenomenon associated with catalysis such as metal binding or heat 
generation. In addition, properties difiBcult to screen rapidly such as substrate stereospecificity 
or metal selectivity can be directly selected using our approach (see below). 
[00219] Key intermediates for a number of C5-functionalized uridLue analogs and C7- 
functionalized 7-deazaadenosine analogs have been synthesized for incorporation into non- 
natural DNA polymers. In addition, the synthesis of six C8-functionalized adenosine an^ogs as 
deoxyribonucleotide triphosphates has been completed. Because only limited information exists 
on the ability of DNA polymerases to accept modified nucleotides, we chose to synthesize 
analogs were synthesized that not only will bring metal-binding functionality to nucleic acids but 
that also will provide insights into the determinants of DNA polymerase acceptance. 
[00220] The strategy for the synthesis of metal-binding uridine and 7-deazaadenosine analogs 
is shown in Fig. 47. Both routes end with amide bond formation between NHS esters of metal- 
binding functional groups and amino modified deoxyribonucleotide triphosphates (7 and 13). 
Analogs 7 and 13 as well as acetylated derivatives of 7 have been previously shown (D. M. 
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Nucleic Acids Res. 2001, 29, 1898-1905; S. E. Lee etal Nucleic Acids Res. 2001, 29, 1565-73; 
K. Sakthivel et al. Angew. Chem. Int. Ed Engl. 1998, 57, 2872-2875) to be tolerated by DNA 
polymerases, including thennostable DNA polymerases suitable for PGR. This convergent 
approach allows a wide variety of metal-bindrag ligands to be rapidly incorporated into either 
nucleotide analog. The synthesis of 7 has been completed following a previously reported (K. 
Sakthivel et al Angew. Cheni. Int. Ed. Engl 1998, 57, 2872-2875) route (Fig. 48, Phillips, 
Chorba^ Liu, vmpublished results). Heck coupling of commercially available 5-iodo-2'- 
deoxyuridine (22) with N-allyltrifluoroacetamide provided 23. The 5 '-triphosphate group was 
installed by treatment of 23 with trimethylphosphate, POCI3, and proton sponge (1,8- 
bis(dimethylamino)-naphthalene) followed by tri-n-butylammonium pyrophosphate, and the 
trifluoroacetamide group then removed with aqueous ammonia to afford 7. 
[00221] Several steps towards the synthesis of 13 have been completed, the key intermediate 
for 7-deazaadenosine analogs (Fig. 49). Following a known route (J. DavolL J, Am. Chem, Soc. 
1960, 52, 131-138) diethoxyethylcyanoacetate (24) was synthesized from bromoacetal 25 and 
ethyl cyanoacetate (26). Condensation of 24 with thiourea provided pyrimidine 27, which was 
desulfiirized with Raney nickel and then cyclized to pyrrolopyrimidine 28 with dilute aqueous 
HCL Treatment of28 with POCI3 afforded 4-chloro-7-deazaadenine (29). The aiyl iodide group 
which will serve as a Sonogashira coupling partner for installation of the propargylic amine in 13 
was installed by reacting 29 with N-iodosuccinimide to generate 4-chIoro-7-iodo-7-deazaadenine 
(30) in 13% overall yield from bromoacetal 25. 

[00222] As alternative ftmctionaUzed adenine analogs that will both probe the structural 
requirements of DNA polymerase acceptance and provide potential metal-binding functionality, 
six 8-modified deoxyadenosine triphosphates (Fig. 50) have been synthesized. All functional 
groups were installed by addition to 8-bromo-deoxyadenosine (31), which was prepared by 
bromination of deoxyadenosine in the presence of ScCls, which we foimd to greatly increase 
product yield. Methyl- (32), ethyl- (33), and vinyladenosine (34) were synthesized by Pd- 
mediated Stille coupling of the corresponding alkyl tin reagent and 31 (P. Mamos et al 
Tetrahedron Lett. 1992, J5, 2413-2416). Methylamino- (35) (E. Nandanan et al J. Med. Chem. 
1999, 42, 1625-1638), ethylamino- (36), and histaminoadenosine (37) were prepared by 
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treatment of 23 with the corresponding amine in water or ethanol. The 5'-nucleotide 
ffipBoqpirates^f32-^ 

[00223] The ability of thermostable DNA polymerases suitable for PGR amplification to 
accept these modified nucleotide triphosphates containing metal-binding functionality. Non- 
natural nucleotide triphosphates were purified by ion exchange HPLC and added to PGR 
reactions containing Taq DNA polymerase, three natural deoxynucleotide triphosphates, pUG19 
template DNA, and two DNA primers. Primers were chosen to generate PGR products ranging 
from 50 to 200 base pairs in length. Control PGR reactions contained the foiir natural 
deoxynucleotide triphosphates and no non-natural nucleotides. PGR reactions were analyzed by 
agarose or denaturing acrylamide gel electrophoresis. Amino modified uridine derivative 7 was 
efficiently incorporated by Taq DNA polymerase over 30 PGR cycles, while the triphosphate of 
23 was not an efficient polymerase substrate (Fig. 51). Previous findings on tihie acceptance of 7 
by Taq DNA polymerase are in conflict, with both non-acceptance (K. Sakfhivel et al Angew. 
Chem. Int Ed Engl 1998, 37, 2872-2875) and acceptance (S. E. Lee et al Nucleic Acids Res. 
2001, 29, 1565-73) reported. 

[00224] Non-Natural Metal-Binding Deoxyribonucleotide Triphosphate Synthesis: The 
sjmtheses of the C5-functionaUzed uridine, C7-fimctionalized 7-deazaadenosine, and C8- 
fimctionalized adenosme deo?cynucleotide triphosphates will be completed- Synthesis of the 7- 
deazaadenosine derivatives from 4-chloro-7-iodo-deazaadenine (30) proceeds by glycosylation 
of 30 with protected deoxyribosyl chloride 38 followed by ammonolysis to afford 7-iodo- 
adenosine (39) (Fig. 31) (Gourlain et al Nucleic Acids Res. 2001, 29, 1898-1905), Protected 
deoxyribosyl chloride 38 can be generated fi-om deoxyribose as shown in Fig. 52. Pd-mediated 
Sonogashira coupling (Seela et al Helv. Chem. Acta 1999, 82, 1878-1898) of 39 with N- 
propynyltrifluoroacetamide provides 40, which is then be converted to the 5' nucleotide 
triphosphate and deprotected with ammonia as described above to yield 13. 
[00225] To generate rapidly a collection of metal-binding uridine and adenosine analogs, a 
variety of metal-binding groups as NHS esters vAW be coupled to C5-modified uridine 
intermediate 7 (already synthesized) and C7-modified 7-deazaadenosine intermediate 13. Metal- 
binding groups that will be examined initially are shown in Fig. 47 and include phosphines, 
fliiopyridyl groups, and hemi-salen moieties. If our initial polymerase acceptance assays (see the 
following section) of triphosphates of 8-modified adenosines 32-37 (Fig. 50) suggest that a 
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variety of S-modijBed adenosine analogs are accepted by thermostable polymerases, alkyl- and 
vmyTiiifluo^^ 

triphosphates such as 41 and 42 (Fig. 53). These intermediates are then coupled witii the NHS 
esters shown in Fig. 46 to generate a variety of metal-binding S-fiuactionalized deoxyadenosine 
triphosphates. 

[00226] Evaluating Non-Natural Nucleotides: Each functionalized deoxyribonucleotide 
triphosphate is then assayed for its suitability as a building block of an evolvable non-natural 
polymer library in two stages. First, simple acceptance by thermostable DNA polymerases is 
measured by PGR amplification of fragments of DNA plasmid pUC19 of varying length. PGR 
reactions contain sjnathetic primers designed to bind at the ends of tiie fragment, a small quantity 
of pUG19 template DNA, a thermostable DNA polymerase {Taq^ Pfu or Vent), three natural 
deoxyribonucleotide triphosphates, and the non-natural nucleotide triphosphate to be tested. The 
completely successful incorporation of the non-natural nucleotide results in the production of 
DNA products of any length at a rate similar to that of the control reaction. Those nucleotides 
that allow at least incorporation of 10 or more non-natural nucleotides in a single product 
molecule with at least modest eflBciency are subjected to the second stage of evaluation. 
[00227] Non-natural nucleotides accepted by thermostable DNA polymerases are evaluated 
for their possible mutagenic properties. If DNA polymerases insert a non-natural nucleotide 
opposite an incorrect (non-Watson-Crick) template base, or insert an incorrect natural nucleotide 
opposite a non-natural nucleotide in the template, the fidelity of library amplification and 
translation is compromised. To evaluate this possibility, PGR products generated in the above 
assay are subjected to DNA sequencing using each of the PGR primers. Deviations from the 
sequence of the pUG19 template imply that one or both of the mutagenic mechanisms are taking 
place. Error rates of less than 0.7% per base per 30 PGR cycles are acceptable, as error-prone 
PGR generates errors at approximately this rate (Galdwell et al PCX Methods Applic. 1992, 2, 
28-33) yet has been successftiUy used to evolve nucleic acid libraries. 

[00228] Pairs of promising non-natural adenosine analogs and non-natural uridine analogs are 
also tested together for their ability to support DNA polymerization in a PGR reaction containing 
both modified nucleotide triphosphates together with dOTP and dGTP. Successftil PGR product 
formation with two non-natural nucleotide triphosphates enables the incorporation of two non- 
natural metal-binding bases into the same polymer molecule. Functionalized nucleotides that are 
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especially interesting yet are not compatible with Taq, Pfu, or Vent thennostable DNA 
polymerases can stiflTelised in libranes~p^ are accepted By 

available DNA polymerase such as the Klenow jfragment of jB. coli DNA polymerase I, T7 DNA 
polymerase, T4 DNA polymerase, or M-MuLV reverse transcriptase. In this case, the assays 
require conducting the primer extension step of the PGR reaction at 25"37°C, and firesh 
polymerase must be added at every cycle following the 94°C denaturation step. DNA 
sequencing to evaluate the possible mixtagenic properties of the non-natural nucleotide is still 
performed as described above 

[00229] Generating Libraries of Metal-Binding Polymers: Based on the results of the above 
non-natural nucleotide assays, several libraries of —10^^ different nucleic acid sequences will be 
made containing one or two of the most polymerase compatible and chemically promising non- 
natural metal-binding nucleotides. Libraries are generated by PGR amplification of a synthetic 
DNA template library consisting of a random region of 20 or 40 nucleotides flanked by two 15- 
base constant priming regions (Fig. 54). The priming regions contain restriction endonuclease 
cleavage sites to allow cloning into vectors for DNA sequencing of pools or individual library 
members. One primer contains a chemical handle such as a primary amine group or a thiol 
group at its 5' terminus and becomes the coding strand of tihte library. The other primer contains 
a biotinylated T at its 5' terminus and becomes the non-coding strand. The PGR reaction 
includes one or two non-natural metal-binding deoxyribonucleotide triphosphates, three or two 
natural deoxyribonucleotide triphosphates, and a DNA polymerase compatible with the non- 
natural nucleotide(s). Following PGR reaction to generate the double-stranded form of the 
library and gel purification to remove unused primers, library member duplexes are denatured 
cheirdcally. The non-coding strands are the removed by several washings with streptavidin- 
linked magnetic beads to ensure that no biotinylated strands remain in the library. Libraries of 
up to 10^^ different members are generated by this method, far exceeding the combined diversity 
of previous combinatorial catalyst efforts. 

[00230] Each library is then incubated in aqueous solution with a metal of iuterest fi:om the 
following non-limiting list of water compatible metal salts (Fringueli et al. Eur. J. Org, Chem. 
2001, 2001, 439-455; Zaitoun et al J, Phys. Chem. B 1997, 1857-1860): ScCls, CrCb, MnCt, 
FeCl2, FeCls, C0CI2, NiCl2, CuCh, ZnCk, GaCb, YCI3, RuCla, RhCla, PdCb, AgCl, CdCla, 
InCls, SnCl2, La(OTf)3, Ce(OTJ03, Pr(OTf)3, Nd(0Tf)3, Sm(OTf)3, Eu(0Tf)3, Gd(OTf)3, 
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Tb(OTf)3, Dy(OTf)3, Ho(OTf)3, . Er(OTf)3, Tm(OTQ3, Yb(0Tf)3, Lu(0Tf)3, IrCb, PtClz, 

to be catalyzed. For example, libiaries aimed at reactions such as aldol condensations or hetero 
Diels-Alder reactions that are known (Fringuelli et al Eur. J. Org. CItem. 2001, 2001, 439-455) 
to be catalyzed by Lewis acids are incubated with ScCls or with one of tihe lanthanide triflates, 
while those aimed at coupling electron-deficient olefins with aryl halides are incubated with 
PdCl2. The metalated library is then purified away firom unbound metal salts by gel filtration 
using sephadex or acrylamide cartridges, which separate DNA oligonucleotides 25 bases or 
longer firom unbound small molecule components. 

[00231] The ability of the polymer library (or of individual library members) to bind metals of 
interest is verified by treating the metalated library firee of unboimd metals with metal staining 
reagents such as dithiooxamide, dimethylglyoxime, KSCN (Francis et al Curn Opin, Chem, 
Biol 1998, 2, 422-8) or EDTA (Zaitoun et al X Phys. Chem, B 1997, 101, 1857-1860) that 
become distinctly colored in the presence of different metals. The approximate level of metal 
binding is measured by spectrophotometric comparison with solutioiis of free metals of known 
concentration and with solutions of positive control oligonucleotides containing an EDTA group 
(which can be introduced using a commercially available phosphoramidite firom Glen Research). 
[00232] In Vitro Selections for Non^-Natural Polymer Catalysts: Metalated libraries of 
evolvable non-natural polymers containing metal-binding groups are then subjected to one-pot, 
solution-phase selections for catalytic activities of interest. Library members that catalyze 
virtually any reaction that causes bond formation between two substrate molecules or that results 
in bond breakage into two product molecules are selected using the schemes proposed in Figs. 54 
and 55. To select for bond forming catalysts (for example, hetero Diels-AIder, Heck coupling, 
aldol reaction, or olefin metathesis catalysts), library members are covalently linked to one 
substrate through their 5' amino or thiol termini. The other substrate of the reaction is 
synthesized as a derivative linked to biotin. When dilute solutions of library-substrate conjugate 
are reacted with the substrate-biotin conjugate, those library members that catalyze bond 
formation cause the biotin group to become covalently attached to themselves. Active bond 
forming catalysts can then be separated firom inactive library members by capturing the former 
with inamobilized streptavidin and washing away inactive polymers (Fig. 55). 
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[00233] In an analogous maimer, library members that catalyze bond cleavage reactions such 
as^Teta>aid^ 6r~olefS aiKj^ 

followed by periodate cleavage can also be selected. In tiiis case, metalated library members are 
covalently linked to biotinylated substrates such that the bond breakage reaction causes the 
disconnection of the biotin moiety from the library members (Fig. 56). Upon incubation under 
reaction conditions, active catalysts, h\xt not inactive library members, induce the loss of their 
biotin groups. Streptavidin-linked beads can then be used to capture inactive polymers, while 
active catalysts are able to elute from the beads. Related bond formation and bond cleavage 
selections have been used successfully in catalytic RNA and DNA evolution (Jaschke et al Curt\ 
Opin, Chem, Biol 2000, 4^ 257-62) Although these selections do not explicitly select for 
multiple turnover catalysis, RNAs and DNAs selected in this manner have in general proven to 
be multiple turnover catalysts when separated from their substrate moieties (Jaschke et al Curr. 
Opin. Chem. Biol 2000, 4, 257-62; Jaeger et al Proc, Natl Acad. Scl USA 1999, 96, 14712-7; 
Bartel et al Science, 1993, 261, 141 1-8; Sen et al Curr. Opin. Chem. Biol 1998, 2, 680-7). 
[00234] Catalysts of three important and diverse bond-forming reactions will initially be 
evolved; Heck coupling, hetero Diels-Alder cycloaddition, and aldol addition. All three 
reactions are water compatible (Kobayashi et al J. Am. Chem. Soc. 1998, 120, 8287-8288; 
Fringuelli et al Eur. J. Org. Chem. 2001, 2001, 439-455; Li et al Organic Reactions in Aqueous 
Media: Wiley and Sons: New York, 1997) and are known to be catalyzed by metals. As Heck 
coupling substrates both electron deficient and imactivated olefins will be used together with aryl 
iodides and aryl chlorides. Heck reactions with aryl chlorides in aqueous solution, as well as 
room temperature Heck reactions with non-activated aryl chlorides, have not yet been reported to 
our knowledge. Libraries for Heck coupling catalyst evolution use PdCh as a metal source. 
Hetero Diels-Alder substrates include simple dienes and aldehydes, while aldol addition 
substrates consist of aldehydes and both silyl enol ethers as well as simple ketones. 
Representative selection schemes for Heck coupling, hetero Diels-Alder, and aldol addition 
catalysts are shown in Fig. 57. The stringency of these selections can be increased between 
rounds of selection by decreasing reaction times, lowering reaction temperatures, or using less 
activated substrates (for example, less electron poor aryl chlorides (Littke et al J. Am. Chem. 
Soc. 2001, 723, 6989-7000) or simple ketones instead of silyl enol ethers). 
[00235] Evolving Non-Natural Polymers: Diversification and Selecting for Stereospecificity 
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* [00236] Following each round of selection, active library members are amplified by PGR with 

for desired catalysts. These libraries are truly evolved by introducing a diversification step 
before each round of selection. Libraries are diversified by random mutagenesis using error- 
prone PGR (Caldwell et al PCR Methods Applic. 1992, 2, 28-33) or by recombination using 
modified DNA shufiEling methods that recombine small, non-homologous nucleic acid fragments. 
Because error-prone PCR is inherentiy less eflScient than normal PCR, error-prone PCR 
diversification will be conducted with only natural dATP, dTTP, dCTP,- and dGTP and using 
primers that lack chemical handles or biotin groups. The resulting mutagenized products are 
then subjected to PGR translation into non-natural nucleic acid polymers xising standard PGR 
reactions containing the non-natural nucleotide(s), the biotinylated primer, and the amino- or 
thiol-terminated primer. 

[00237] In addition to simply evolving active catalysts, the in vitro selections described above 
are used to evolve non-natural polymer libraries in powerful directions difficult to achieve using 
other catalyst discovery approaches. An enabling feature of these selections is the ability to 
select either for library members that are biotinylated or for members that are not biotinylated. 
Substrate specificity among catalysts can therefore be evolved by selecting for active catalysts in 
the presence of the desired substrate and then selecting in the same pot for inactive catalysts in 
the presence of one or more undesired substrates. If the desired and undesired substrates differ 
by the configuration at one or more stereocenters, enantioselective or diastereoselective catalysts 
can emerge from rounds of selection. Similarly, metal selectivity can be evolved by selecting for 
active catalysts in the presence of desired metals and selecting for inactive catalysts in the 
presence of undesired metals. Conversely, catalysts with broad substrate tolerance can be 
evolved by varying substrate structures between successive rounds of selection. 
[00238] Finally, the observations of sequence-specific DNA-templated synthesis in DMF and 
CH2CI2 suggests that DNA-tetralkylammonixun cation complexes can form base-paired 
structures in organic solvents. This finding raises the possibility of evolving our non-natural 
nucleic acid catalysts in organic solvents using slightly modified versions of the selections 
described above. The actual bond forming and bond cleavage selection reactions will be 
conducted in organic solvents, the crude reactions will be ethanol precipitated to remove the 
tetraalkylammonium cations, and the immobilized avidin separation of biotinylated and non- 
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biotinylated library members in aqueous solution will be performed. PGR amplification of 
selected members will then take plafce as described above. I'he successful evolution of reaction 
catalysts that function in organic solvents would expand considerably both the scope of reactions 
that can be catalyzed and the utility of tiie resulting evolved non-natural polymer catalysts. 
[00239] Characterizing Evolved Non-Natural Polymers: Libraries subjected to several 
rounds of evolution are characterized for their ability to catalyze the reactions of interest both as 
pools 'of mixed sequences or as individual library members. Individual members are extricated 
firom evolved pools by ligating PGR amplified sequences into DNA vectors, transforming dilute 
solutions of iigated vectors into competent bacterial cells, and picking single colonies of 
transformants. Assays on pools or individual sequences are conducted both in the single 
turnover format and in a true mxiltiple turnover catalytic format. For the single tumover assays, 
tiie rate at wliich substrate-linked bond formation catalysts effect their own biotinylation in the 
presence of free biotinylated substrate will be measured, or the rate at which biotinylated bond 
breakage catalysts effect the loss of their biotin groups. Multiple tumover assays are conducted 
by incubating evolved catalysts with small molecule versions of substrates and analyzing the rate 
of product formation by tic, NMR, mass spectrometry, HPLC, or spectrophotometry. 
[00240] Once multiple tumover batalysts are evolved and verified by these methods, detailed 
mechanistic studies can be conducted on the catalysts. The DNA sequences corresponding to the 
catalysts are revealed by sequencing PGR products or DNA vectors containing the templates of 
active catalysts. Metal preferences are evaluated by metalating catalysts with a wide variety of 
metal cations and measuring the resulting changes in activity. The substrate specificity and 
stereoselectivity of these catalysts are assessed by measuring the rates of tumover of a series of 
substrate analogs. Diastereoselectivities and enantioselectivities of product formation are 
revealed by comparing reaction products with those of known stereochemistry. Previous studies 
suggest that active sites buried within large chiral environments often possess high degrees of 
stereoselectivity. For example, peptide-based catalysts generated in combinatorial approaches 
have demonstrated poor to excellent stereoselectivities that correlate with the size of the peptide 
ligand (Jarvo et al J. Am, Chem. Sac. 1999, 727, 11638-11643) while RNA-based catalysts and 
antibody-based catalysts frequently demonstrate excellent stereoselectivities (Jaschke et al Curr. 
Opin, Chem. Biol 2000, 4, 257-262; Seelig et al Angew. Chem. Int, Ed. Engl 2000, 5P, 4576- 
4579; Hilvert, D. Annu, Rev. Biochem. 2000, 69, 751-93; Barbas et al Science 1997, 278, 2085- 



80 of 88 



wo 02/074929 




PCT/US02/08546 



92; Zhong et al Angew. Chem. Int, Ed Engl 1999, 38, 3738-3741; Zhong et al J, Am. Cheiru 
Soc 1997, 119, 8131-8132; List et dL Org. Lett: 1999, 1, 59-61) The direct selections for 
substrate stereoselectivity described above should further enhance this property among evolved 
catalysts. 

[00241] Structure-function studies on evolved catalysts are greatly facilitated by the ease of 
autonaated DNA synthesis. Site-specific structural modifications are introduced by synthesizing 
DNA sequences corresponding to "mutated" catalysts in which bases of interest are changed to 
other bases. Changing the non-natural bases in a catalyst to a natural base (U* to C or A* to G) 
and assaying the resulting mutants may identify the chemically important metal-binding sites in 
each catalyst. The minimal polymer required for efficient catalysis are determined by 
synthesizing and assaying progressively truncated versions of active catalysts. Fiually, the three- 
dimensional structures of the most interesting evolved catalysts complexed with metals are 
solved in collaboration witli local macromolecular NMR spectroscopists or X-ray 
crystallographers. 
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Claims 

1. A method of syntheiszing one or more chemical compounds, the method comprising Hie 
steps of: 

providing one or more templates, which one or more templates optionally have a reactive 
unit associated therewith; 

contacting 6ne or more transfer units having an anti-codon and reactive unit with said one 
or more templates xmder conditions to allow for hybridization of the one or more anti-codons to 
the template, and reaction of the reactive units. 

2. The method of claim 1, wherein a portion of the one or more chenucal compounds 
contain an anti-codon comprising a nucleotide sequence which hybridizes with one or more 
nucleic acid templates. 

3. The method of claim 1, wherein a library of more than one compound is synthesized, the 
method further comprising: 

probing tibe library to detect a library member comprising a reaction product of the 
reactive units displaying a desired property; 

detecting structural information about the reaction product; and 

iterating the method with one or more additional species to produce a new generation of 
reaction products, at least some of which display the desired property. 

3. The method of claim 1, wherein the chemical compound syntheiszed is a compound other 
than a nucleic acid or nucleic acid analog. 

4. The method of claim 1, wherein the chemical compound is an unnatural polymer. 

5. The method of claim 1 , wherein the chemical compound is a small molecule. 
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6. The method of claim 1, wherein the chemical compoimd is a small molecule and the step 
of contacting comprisies contacting with two or more transfer imits in a sequential manner- 

7. The method of claim 1, wherein the template is a nucleic acid template. 

8. The method of claim 1 , wherein the template comprises DNA or RNA. 

9. The method of claim 1, wherein the template comprises DNA. 

10. A library comprising one or more chemical compounds wherein each of the chemical 
compounds is bonded to an amplifiable template whose nucleotide sequence is informative of the 
structure of the chemical compounds. 

1 1 . The library of claim 10, wherein the library of chemical compoimds comprises a library 

-> 

of small molecules. 

12. A method for the synthesis of a library of chemical compounds, the method comprising 
the steps of: 

providing one or more templates optionally associated with one or more reactive units; 

contacting the one or more templates simultaneously or sequentially with one or more 
transfer units comprisuig anti-codon units associated with one or more reactive units under 
condition suitable for hybridization of the anti-codons with the template and reaction of the 
reactive units to produce a plurality of different library members. 

13. The method of claim 12, wherein a portion of the compounds contain an anti-codon 
comprising a nucleotide sequence which hybridizes with one or more nucleic acid templates. 

14. The method of claim 12, the method further comprising: 

probing the library to detect a library member comprising a reaction product of the 
reactive units displaying a desired property; 

detecting stmctural information about the reaction product; and 
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iterating the method with one or more additional species to produce a new generation of 
reaction products, at least some of which display the desired iproper^^ 

15. The method of claim 14, wherein the desired property comprises binding to a target 
protein. 

16. The method of claim 14, wherein the desired property comprises catalyzing a chemical 
reaction. 

17- The method of claim 14, further comprising the step of isolating the one or chemical 
compounds. 

1 8. The method of claim 14, wherein the chemical compounds synthesized are small 
molecules. 

19. A method for the synthesis of one or more unnatural polymers, the method comprising 
steps of: 

providing one or more nucleic acid templates; . 

contacting one or more transfer units with the one or more nucleic acid template imder 
conditions to allow for hybridization and reaction to form bonds between adjacent monomer 
units lined up along the template. 

20. The method of claim 19, wherein the method further comprises the step of isolating the 
unnatural polymer. 

21 . The method of claim 1 9, wherein a portion of the unnatural polymers contain an anti- 
codon comprising a nucleotide sequence which hybridizes with one or more nucleic acid 
templates. 

22. The method of claim 1 9, the method further comprising: 
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probing the library to detect a library member comprising a reaction product of the 
reactive xinits (Usplaying a desired property; 

detecting structural information about the reaction product; and 

iterating the method with one or more additional species to produce a new generation of 
reaction products, at least some of which display the desired property. 

24. The method of claim 23, wherein' the desired property comprises catalyzing a chemical 
reaction. 

25. The method of claim 12 or 19, wherein die nucleic acid template is DNA. 

26. The method of claim 12 or 19, wherein the nucleic acid template is single-stranded. 

27. The method of claim 12 or 19, wherein the nucleic acid template is double-stranded. 

28. The method of claim 12 or 19, wherein tiie nucleic acid template is selected from the 
group consistiag of DNA, RNA, a hybrid of DNA and RNA, a derivative of DNA, and a 
derivative of RNA. 

29. The method of claim 12 or 19, wherein the anti-codons comprise 2, 3, 4, 5, 6, 7, 8, 9, or 
10 b2ises. 

30. The method of claim 12 or 19, wherein the anti-codon does not allow frame-shifdng. 

31. The method of claim 12 or 19, wherein the anti-codon is selected from the group 
consisting of DNA, RNA, a hybrid of DNA and RNA, a derivative of DNA, and a derivative of 
RNA. 

32. The method of claim 12 or 19, wherein the anticodon is associated with the monomer unit 
through a covalent bond. 
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33. A method of evolving a library of compounds, the method comprising steps of: 
isolating a cdmpoimd with a desired activity attached to a nucleic acid template that 

encoded its synthesis; 

mutating and amplifying the nucleic acid template; and 

synthesizing a new library of related compounds using a mutated and amplified nucleic 
acid template. 

34. The method of claim 33 further comprising steps of: 
assaying compounds for desired activity; and 

repeating steps in claim 33 based on template of polymer with desired activity. 

35. The method of claim 34 wherein the desired activity is selected jfrom the groups 
consisting of catalytic activity and binding activity. 

36. The method of claim 34 wherein the step of mutating and amplifying is performed using 
the polymerase chain reaction. 

37. The method of claim 34 wherein the step of mutating and amplifying is performed using 
error-prone PGR. 

38. The method of claim 34 wherein the step of mutating and amplifying is performed using 
in vitro homologous recombination (DNA shuffling). 

39. A library of chemical compounds comprising one or more chemical compounds wherein 
each of the chemical compounds is bonded to an amplifiable template whose nucleotide 
sequence is informative of the structure of the chemical compoimds, and wherein the library is 
synthesized according to the method of claim 1 , 12 or 1 9. 

40. The library of claim 39, wherein the library is a library of small molecules or tmnatural 
polymers. 
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41 . A kit comprising one or more nucleic acid templates and one or more transfer miits. 

42. The kit of claim 41 further comprising buffers, heat-stable DNA polymerase, nucleotides, 
and restriction endonucleases. 

43. The kit of claim 41 wherein the nucleic acid template is associated with a small molecule 
scaffold. 

44. The kit of claim 41 comprising a plurality of nucleic acid templates. 

45. The kit of claim 41 wherein the transfer units comprise monomer units. 

46. The kit of claim 41 wherein the transfer units comprise reactants to be used in modifying 
a small molecule scaffold. 
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candidate molecules 
natural products, 
synthetic molecules, 

pro/elf tM, nucleic acids 



, molecnies with 
desired binding or 
catalytic properties 



ttviLxfttiion i\f 
injiir/ififfiiw into 
tiwtacular 
s/mcfxtrey 




mtqtli/icatitm (and 
iHversIfia^tiiO of 
Infinmtian 



mformution encoding 
molecnies with desired 
properties 




struct bre 
determination or 
tag decodinet i^A'.-i 



chemical approach 
AVifuffeS approach 



structure-activify 
relationships 



1/57 



wo 02/074929 



PCT/US02/08546 




2/57 




3/57 



wo 02/074929 



PCT/US02/08546 



-C— Z— Z— C — 



X=A,T,orC 
Z=T,A,orG 



X=A,T,orC 
Z=T,A,-orG 



.„C_Z— Z-C-C— Z-Z-C-C— Z— Z- , , , , 

I ::::!; i ::::: '• 
d-^-X-e G-X-X-G G-X-X-G 



NHz 



I I 



HOillj (1h2 HO2C HOji 



...c_z— z— c— z— z-np— z— z— c— z— z— c--^--^- 

ijj f-M <^-M ^-H 



Ah. HO,<ij Ah, ho^ Ah, ho^ i^ih, ho,c nh, ho^ 
A quadruplet and triplet non-jrameshiftmg codon set. Eachpravides 9 possible codons. 
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"h^x^yj^ biotin »»» avidin - 
biotin-terminated 



-substrate 




bond-cleavage 



catalysis ^ 

''vu^;,j» biotin '•" a\n'clin product 




substrate 2 




bond-formation ; 
catalysis ^ 




biotin avfdin product 



biotin «"»avidin~ substrate 1 
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1SH reagent 



^^^^^).>H regent 




H template 

template: EHEH--EEHHEEHHEEHHEEHH 

#ofrBagenl qo0303030303 03 0. 303 03 

mtemaiches: ^ ^^^^ i,^ 

temo/afes-^ ^ ^ ^^^^ "f^^ ^^^-J^."^ 



reagents ffl ;^ j§» i?^^' ^ '^""-^ 



5 mln 



template: HHHHHHEEEE EEBEE E E E 
nudfiophfle: SSSSSSSSSSSSSSNNNN 
magent M X M XMXMX.MXMXMXMXMX 




SlAe SBA^ SIA SMCC 6MaS BMPS SySB SMCC SVS8 



Teoiprals-g' 



S 5 " SMCC BMPS 

T jjA^ 



S8AP 
O 



A^l " GMB3 
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G-C-^ h-C-A-T'G-C-T-T-A-A-5' 



# of mismatches: 




am6al2S*C;' 1 mln* " Sniln 



.„,^o« 0 13 0 

temp for leh: ^9 c '-^ — 



Figure 7^ 
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J Of mismatches: 0 3 0 3 0 3 
i products — ^ 



03030303j2 



backbone: ^ « CAB>. (C3). (HG)s (Hc,. (HC)e 
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'"Lk-x-x-x-x-x-x-x-x-x-s » 

*-C-G-»-A--t-T-C-G-A-C-T-C-G-G-G. - . 3» 

n bsses 

(reagent] = [templalej 



60 



120 




-^n^l, lis nM 
— n«f,2SnM 

-*-n-1, 125 nM 



30 

time (min) 
-M»10, iz5nM 

^n"10, 62.5 nM 
-*.n°10, 125(1M 

Figure 10 



mis, 12.5 nM 

-».infs,62.SnM 
-r-mte, 125 nM 
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O'"^^^0 .j^^TGGTGCgCftCCCCCC SMCHjtftCTN CJiTACCXCCTCCGAGC^ 

O biqtrn 



1,02S total 
slarUng 
materials 



templalB-dlrected 
translaUon of ONA 
nbraiy Into synthetic 
coinpounds 



) 



HS C ACTGCCCA C-S ' onereagon! I 1,025tolal 
' reagonts 



blotln^ 



OHB predud 



[fina4NSATACCACCrcx:GAGCCGA6GlVGCaS-3> 
of 1^4preduct3 

1] in vitro selection wtlh 
streptavfdin beads 

2) PGR ampUncatlon 
of selected products 

5 '"TCGTGCCGAGCCGCC G?? ??? 7?? GATACCACCTCCCAGCCG>gCAGeCB-3 ' 

DMA oicofSng selected and amplified molecules 

Icharacterfze by DNA 
sequencing and cGgesthm 

S ' -TGCTCCCCACCCCCC CTGACCGCT <aTACCACCTCCGACeCBUMA<a:cs.a « 

Figure 11 



1,025 
presumed 
products of 
" 1,050.625 
ttieoraticai 
products 



prunaiy product 
(I.OOO-fold 
anrichmerd) 
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2 3 4 5 e 



(b) 



J nbnuy 




4t 

' I ttmoalnoxr 

I magBntlStnuy ^ 

4e' 



react, lapsal 
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^ Sl candmons product yield {%) 

1 2 T o 



1 3 



n 10 



17 IS*^*^ 




18 15"'"^'^ o"^"" ° ^ 



44 

18 i« IM^ f^^g 




20 ^ 19 ' 




'=V^ 31 
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reaotion: 1 + 3 4 + 5 10 + 11 11 + 13 12 + 15 18 + 19 

matchedness: MXMXMXMXMX M __X^ 

products ► 

templates 
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5'vvwvvs>B 



DNA-tempteted 
iO bases amide bond 

formation ^ 3«x/sAwA^prcxlucl-vsA<'5' 



a 

4 
4 
4 
4 

5 ° 

O 

H = 
S ° 



6 " 



6 



7 " 

4 



OH 



8 



product 



Ph 




79.59 



73,54 



79.46 



81,62 



58,66 



.47.64 



58,53 



56,71 
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(a) 



.2 
E 

p o 
a> 



rate of bond 
formation '\ 




distance 
independent 
regime 



n bases 



rate of 

template-reagent 
annealing 



n 



(b) 



n^O n^O n^O n-10 n=10 n=10 



product — ► 
template — 1^ 



time (min): 5 60 780 5 60 780 
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X + ^^^^ R I i p 
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aoarless nnkar 



useful scar linker 
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autoclenved tinker scarless tinker useful scar tinker 

o 




^ ^ Nal04, H2O ^ 

^^^^ temptela-w^^N'^Y'^Y^" 



R O 
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reagen t ■ "■'-SH 
DCC 



•a 




AgCOaCFa . 



reagent R-^^lempIate R^'^Nh^^'-^^MenpIate 



'OH 



SH 



^ Butoclea ved linker 



R-*«^tenriplate 
useful scar linker 
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DDAB-Templaler-N' 



'"^-^^ 



O DCC,NHS 



DCAB*Reage 
DDAB = MB2(rhCi2l-<2^zN^ 

template: + none 
reagent matchedness: none X 
preanneated In water?: 
product 

tompistB - 
reagent - 



^ Temptate— N 

cH2a2 " 




+ 


+ 


none 


+ 


X 


X 


M 


M 




a 




Y 
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Start Monomer 



I 



o-NBOC 
I 





covalendy-fntact template ONA strand 

C itiiiiniiiG^ Starting string of dC's provides 

good jniliatton and a site for 
'V PGR priming with oligo-dG 



QllliliiliiiQ 




Photocaged primary amine 
prevents premature 
injttatlon of cart>amat9 
polyi%riza15on 



HO 



Me \ 



O 



TBSO, 




lltniinnB)* 



Every 2 nucleotides 
encodes one dfcarbamate 
"monomer"; this provides 
14. functional codons. 
1 start codon, 1 stop 
codon 



Extend Monomers 




\-saO Ms ) 



5^ Hlltdlllt B2* 



?^ >-/ 




B4 iiitiniib B4' 



HO 



sO Me 
o^^^ ■'x ^^^^^^ 




A>><"i'iiii T 



Stop Monomer 



photodeproiect 

biopolymer a 

polymerization n 



Bs \ / 

y«o 




MeO 
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Start Hairprn: covalently j 
linked to template I 



Neil cleaves 
exclusively here 



Neil does not deave 
phosphorothloate 




encoding DNA suitable for PCR, 
sequencing, etc 

3' OhPii"l»»"»i"""> GGGCCXnGGC-5' 
+ PNA 



digest with NcJ\ 



y OH'"«"«»"«"«"«"i-vvwvvxAA/'biotin ^vidin purify 
encoding DNA sidechain-bearing — 
PIMA * 



Components of an amplifiable, evolvable functionalized peptide nucleic acid library. 
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T— G— C 



— G— C— 



C-^C — G 



solvent 




_T_C— T^A— A— A— G— A— 6— t;^ ^ r 

T T T • C 



DNA' 
hairpiii 



coupling reagent, 
leaving group 



PNA Q 



Oregon Green^ t * ^/ ^ 

^X«X-i T-C V— G-C-G-C-(y^| 
:_C— G— T— C — T—A— A— A— G— A— C— G— 6— G— \^ 

Test reaction used to optimize reagents and conditions for DNA-templated PNA coupling. 
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Two schemes forihe selection of a biotin-terminatedfLinctionalizedPNA capable of catalyzing 

an aldol or rettoaldol reaction. 




' DNA template-directed synthesis of a combinatorial small molecule library. 
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H H H H 

O (^0° (\>^ S'-NWa 

T T T C \ 

-T^— A— A— G-fl^C^--G]-C--G— C 

J " ' 

anneal, measure rate of acylatioii 

\ PNA 




• ■ ■ I 

• • i 1 r 

^^-^vv'vvsa^vvvnaq — C— G — T — C — T^^— — A — 



Measuring the rate of reaction between a fixed nucleophile and an electrophile hybridized at 
varying distances along a DNA template defines an essential reaction window in which DNA- 
templated syntliesis of nonpofymeric structures can take place. 
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autocleaving linker 



template 



scarless linker 

g H 7 " 

I |^N««*wtemplate 



autocleaving linker 

S^(Chis)^HC0(CH2)sNHR 
I 4 



yAgCOzCFa 

FyMHCCHzlsCOHNCCtyf^WH 

5 . lemptale 



useful scar linker 



H 




PhHzC 



10 



template 



pH11.8 



9 O 



11 

Nal04. pH 5.0 



tempial 



12 




SI 
CO 

E 

(DU3 

W 



•a 

i 



8 



o 

CO 

> 

CO 

13 



T3 O 

® to 



^ r ^ 

unstained (dansylated 
species only are visible) 




i"0 



stained with ethidiunj bromide 



'33 
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molecul 



J \ mc3lecul& 

Y 



tamplaiB 



product and 
unreactBd template 
venter the next step 



produci^ 



DNAtempbted 
synthesis usng an 
autodeavinglihler 




■I- blofin ^"wa^ 



molecule^ 



DNAtempbted 
^thesis usfng 
a traoaless linker 
or useful scar tin kar 



el ute product by 
■inter cbavage 




jre 
v\^mavi din- 
In kBd beads 



teagent 
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bloti|i 

3 • -GCGXXXXCCCGAGGiTT'^NHa-S' 



J)EDC,siilfi>-NHS 
2)NH40H 



3 • -GCGXXXXCCCGACGTT-^NB £ 

1 

tempi ate pool | 

-3 1 .GCOXXXXCCCGAGGTT^^NH £ 

-CCGXXXXCCGXXXXGCCXXXXCGCXXXXGaOCTCCAA-3' 

DreiiiDNApolymc 
2) denature stxaaAs, purify with 

avidinmagneticbeads | ^^^^^ X^NH, 

»-CGCXXXXGGCX3aXC603O^ £ 

•ncadej encodes Mcoito* eneo** ^ 
R« «j Sj.- 
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o A nagwtObmy R o 




AC|0 (capping) 
~ I1.S 



1) pH3, 

2) £DCsuiro-NHS 



R4 a 
EDC, stUro-NHS 
•l)Ac,0.tbeii pHlLS 



templi 



tempi attt'^'^'^^l 



Ri O 



AeHHr^R4 



Stmetuns of nageat Obfarits: 
5 -GCCXXXXCGC-anlcep-aimno add, 
S'^CXXXXCCCMinker-amlno add, 
5'-GCGXXXXCCG-UnhefMimlB0 add 4 
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1 



6'4eagBiTtoHgo-'v%^N' O ^ - v O-NHS 



I 

M^^o'^^^'^ ^ or ^0 

y-ioagentoHgo'v^^fj^ O ° fj T 

S'^TBageiMtlAsUoke^tfrfno add 
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template 
bases 21-30 

EDC. sulfo-NHS I Hl^ 

DNA-tempIated ^-OH 

amide formation I o 

(step1) Y 



H 

capture with avidln-linked 
beads. etutewfthpH 11.8 
buffer 



14 , & ^ 

anneal second reagent 



1' 



template 

Moth (3 

>OH 

anneal ihird reagent 




template 



3) 
len 

pH 11.8 buffer - 



1) DMT-MM (step 3) ^^f^OH 

2) ayidin beads, then | 

OH 



16 
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bases 21-30 



btotin .Hhh^ OH 



(slep 



1^ OH 

i capture with 
streplavidin beads, 
elutewimNaI04 



I 



anneal second reagent 



template 



bfoUh HN-\ 



1) DNA-lempIaledWI«lg \5>N*-N^Na.rfa«ij 
oleflnaUon(slep^66%) Ftr%J[ NH^Iansyl 

2) wash With avldtn beads 1 




H 

[J'^'^'dansyl 



21 



anneal thInJ reagent 



bloUn 



bases <-fO « m 



HI 



o o o o 

DNA-tempIaled conjugate 
addition (step 3, 75%) 




dansyl 



capture with 
slreptavtdin beads, 
Y elutev^lthpH 113 buffer 



24 " ^=«<CH^H2 
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blotin 

3 '-GCXCtSECGTOGTCGqcogceGaQgoocGGSogc^ Hbraiy 



Ra R2 Rj 
in ^dtro setection 

tfotln 

3 ' -■<;cTCGTarrGGTCGCGTG^GGCTAGca:GQACGTGCGQATGa:a^^ molecule 

IFCRirtth S'-CGAGCAGCACCAGCG-3' and 
3'-GCGXXXXCCCGAGGT(bio{in)T-NH-aminoac{d 
biotln 

3 • -KKnxiGTCGTGGTOscGTgvGGC TAxsGC GG AasTO ca oaflTO Cca material 

5 ' -CGACCAGCAOCAGCGCACTCX:6AICOCCC7OCA0QCCTACGGGCTCCAA ()naiiy COf^s) 



DNA-teaplat«d»y.tl.oi» [_ ^_ ^ w^nlteb, 
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^'^f '^l^ '^SK**^ 



temp/atooncotffng 3»-oc«c<sifcaTa<jrcccoTOAoocb«A.:-TAOOCCG.joccrACJTGCGCT^ 

panntmoleCUtO 1 S • - CGAGCACCACCAaCOCACTCCOCCTOnATCCOCC'rCaGCTGCACOCOACT C CTACOOCCT CCAX- 3 



meooiu «ncams wtodhm wi^-- ^t^.,^ 

R< R» Rt_ ftetfp 

toimfatD eneorfbisr 3 • -acTcoicoiaaTCCCicAGaaccaA •2cccTAC0oafiC(:!»cccococT«A«a«a«coA^ 

fWranf mofoeuto 2 5 • -C6AOCIVOCACCAeCOXQTCCCCCCTOaOOATOCCCC«OC»aOGCaCOACTCCWVCOOGC«CCAX-3 • 

digest with ^961 aod HinVl 



3 •-OCTCGYCGTOOTCGCOXOAOaCG 
5 * -COAGCAGCACCAGCaCACTCCOCCtG oCCCACCTGCOC 

GTceiwcocaAcr 

GACeTJkGOCOOO ioagoatgcccoaoott-hh—R 
3*-OCVCaTCG!raO«CGCTCA9GaCS OATCCSCCCCGO CCTACOaGC7CCAA-3' 

S • -CGAGCAGCACCAGCOAGTCCCOCCtG ''""^^'^^'^..j.cACCCOCOC 

nTGOGCOCGACT TOAOOTTOCCCOAGGTT-BH— R 
<IACCCCXAC6aO CC AAC GOGCT CCAA- 3 » 

GG0A36CCCC6G 

I T4DNAngnse 

3»-eCTCO»COTOG»CaCOTGKOOCaGACCCCTACSa«USCeACCC6COCTGAOaA'IOCCCGAOOTT-BM--R 
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R r 2'-deoxyribose-5'-triphosphate 
R2- 



SH 

COOH 



A 
OH 
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if 



_ PdCl2,NaOAc buffer, ^, 
jj N-allyltrifluoroawtamide j |] 



22 



58% 

R-2'-deoxyribose 




23 



1) POCb, proton sponge, 1 
iHcocFa trimethyl phosphate ^ NH'''^Y'^*''^H2 

2) tri-n-butylammonium cr^Tjf^ R'=s2'-deoxyribose 
pyrophosphate, DMF S'.triphosphate 
-^^NH* 7 
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27 



25 26 24 O cfiovL Tl I 

RaneyNI 9 CI 

^ ccT ^ c6 



DRaneyNI CI t pl SH 

N-iodosuccinlmfde 



2)0. 

91 % H 

28 29 30 



91% fi^ fi^N^ 
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NH2 
R 

R = 2'-deoxyiibos8 



Br2.cat.ScCb Bn 
60% 



H 

p.. 



NH2 



a, b, 0. d, e, or f 
20-84% 



NH2 NH2 



31 



R 

32-37 



1 

R* 

R' = 2*-deoxyrfbose- 
S'-trfphosphate 



a (R = Me): 1) HMDS, dtoxane, 2) Me4Sn. Pd(PPh3)4, NMP, 3) K2CO3, MeOH 
b (R a Et): 1) HMOS, dloxane, 2) Et4Sn, Pd(PPh3)4, NMP. 3) K2CO3, MeOH 
- ■ ■ * , <iioxane, 2) (CH2'=CH)4Sn, Pd(PP 



C(R 
d(R 
e(R 



'CH2=CH2):1)HMDS. 
«NHMe):MeNHzH20 
*NHei):ElNH2,H20 



l)4Sn, Pd{PPh3>4. NMP, 3) KaCOa. MeOH 



f (R«hlsfamlnyl): his 



Eton, heat 
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Tod' 38 
NaH 



V^f NHa/MeOH, 



65° C 



30 



-NH2 cAcFa 
ho'' 39 HO 40 




n 1)HCl/MeOH 



3)p-To!uoyl-CI j^^^' 



Me HaAcOH 



TolO^ / 
Tol</ 38 
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H2N^ 



NHz 



41 R' 42 R 

R' B 2'-<leQ)Qtfibose-5-irtDhosDhate 



NH2 
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20 or 40 ramiom bases 

-ACGIUteCCGCS ZC6CVinn»)NNin«SNHmiXiXI2»2NCCGTCASC6AG CCCT- 3 ' 

synthetic template ilbrary 



P NA p olymerase 
dCTP,dGTP 
a noQ-natnraJ dA*TP or d ATP 
nucleotide dU*TP or dTTP 



3 • -GGCAGTAGCTC6 GGAT'«'^Ba- 5 ' 
5 * -TACGSAGCGGC6TCGC- 3 * 
biodn 



3 • "ATGCAICGCCGCAGCGWKllWNinnjiyKiyfeaCTKiniO 

5 • -XAC GTAGC StSCGT CSCNKNBmKHNNNimNinnnraSSCC GTCAX CGA6C CCIA- 3 • 

* * # ** 

denature strands, remove 
undesired strand with 
avidin magnetic beads 

3 • ^A^GCiiscGccGCAGcemmhnmxtJsnsm&mmjmv^ ' 
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NHS-O'^ubstnitei -C^^ ° 



NHS-O'^ubstnitei -CQp^ ^* 

fBond Forinatioa Selection I *4 ■ 



amplify by PGR, substralej-WpUn 
diversily into next » 
generation library I 



1' 



capture active VsubstiatB, 
D catalysts with 

O immobilized 

.-NiiHluct-biotlh , 

O J^ft. acthrecafa/yst 
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9 btQtfn 
rtS-O-'Ssubslrat/ 



amplify by PGR, 
diversify into next 
generation library 



substrate^ 
teotin J 




O blotin 
I'Ssubslrat/ 



I Bond Qeavage Sdection | 



remove inactive 
catalysts with 
immobilized 
avidin 



bloUn 



pradi 



tdotin 



active eatafyst 
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DNA fibraiy vfith metM 
blnOnggmupsn 



(e.g. PdCfe) 



b!otln-N^t 



acBve Hack 
ooupilng catalysts 



select acUvs catelysta 
wlih bnmobnized avfdin 



metal source 
<e.g.Sc(OTIj3) 



bioGn-Nhb 



l8-oA»-^CHO 



hetero 
Dlels-AIder 



acj^ftjggp vrimrmmobillzBd avfdin 



metal source 
(a.g,Yb(OTna) 



NHS-cr^^*-^CHO 

WoUn-NHi Wot! 



aJdo) 
addition 



acdve aldoi 
aOdsUon catalysts 



select active calafysts u 9^ 9 m 

with hnmotMlteed avfdin biolIn4l,y''v,,A^,A^x-Y^ 
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